MODIFIED CEA NUCLEIC ACID AND EXPRESSION VECTORS 
FIELD OF THE INVENTION 

The present invention relates to a nucleic acid encoding a polypeptide and the use of 
the nucleic acid or polypeptide in preventing and / or treating cancer. In particular, the 
5 invention relates to improved vectors for the insertion and expression of foreign genes 
encoding tumor antigens for use in immunotherapeutic treatment of cancer. 

BACKGROUND OF THE INVENTION 

There has been tremendous increase in last few years in the development of cancer 

10 vaccines with Tumour-associated antigens (TAAs) due to the great advances in identification 
of molecules based on the expression profiling on primary tumours and normal cells with the 
help of several techniques such as high density microarray, SEREX, immunohistochemistry 
(IHC), RT-PCR, in-situ hybridization (ISH) and laser capture microscopy (Rosenberg, 
Immunity, 1999; Sgroi et al, 1999, Schena et al, 1995, Offringa et al, 2000). The TAAs are 

15 antigens expressed or over-expressed by tumour cells and could be specific to one or several 
tumours for example CEA antigen is expressed in colorectal, breast and lung cancers. Sgroi et 
al (1999) identified several genes differentially expressed in invasive and metastatic 
carcinoma cells with combined use of laser capture microdissection and cDNA microarrays. 
Several delivery systems like DNA or viruses could be used for therapeutic vaccination 

20 against human cancers (Bonnet et al, 2000) and can elicit immune responses and also break 
immune tolerance against TAAs. Tumour cells can be rendered more immunogenic by 
inserting transgenes encoding T cell co-stimulatory molecules such as B7.1 or cytokines 
IFNgamma, IL2, GM-CSF etc. Co-expression of a TAA and a cytokine or a co-stimulatory 
molecule can develop effective therapeutic vaccine (Hodge et al, 95, Bronte et al, 1995, 

25 Chamberlain et al, 1 996). 

There is a need in the art for reagents and methodologies useful in stimulating an 
immune response to prevent or treat cancers. The present inventions provides such reagents 
and methodologies which overcome many of the difficulties encountered by others in 
attempting to treat cancers such as cancer. In particular, the present invention provides a 

30 novel coding sequence for CEA. This nucleotide sequence, CEA(6D)-1 ,2, includes sequence 
modifications that eliminate the expression of truncated forms of CEA as expressed from 
expression vectors. Such a modified sequence is desired by those of skill in the art to 
improve expression and immunization protocols for CEA. 



SUMMARY OF THE INVENTION 

The present invention provides an immunogenic target for administration to a patient 
to prevent and / or treat cancer. In particular, the immunogenic target is a CEA tumor antigen 

5 ("TA") and / or an angiogenesis-associated antigen ("AA"). In one embodiment, the 
immungenic target is encoded by a modified CEA nucleotide sequence (CEA(6D)-1,2) that 
improves CEA expression in transfected cells. In certain embodiments, the TA and / or AA 
are administered to a patient as a nucleic acid contained within a plasmid or other delivery 
vector, such as a recombinant virus. The TA and / or AA may also be administered in 

10 combination with an immune stimulator, such as a co-stimulatory molecule or adjuvant. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. A. Illustration of plasmid p3'H6MCEA comprising the CEA coding sequence 
with the 6D modification under the control of partial H6 promoter. B. Illustration of plasmid 
15 pSE 1 544.9 (pUC 1 8-mCEA repeat 1 ). 

Figure 2. Illustration of plasmid pSE16!6.44 (pUC18-mCEA-modified repeat 1). 
Figure 3. Illustration of plasmid pSE1658.15 (p3'H6MCEA-modified repeat 1). 
Figure 4. Illustration of plasmid pBSmCEA. 

Figure 5. Illustration of plasmid pSEl 686.1 (pUC18 mCEA modified repeat 2. 
20 Figure 6. Illustration of plasmid pSE1696.1 (pUC18 mCEA modified repeat 2. 
Figure 7. Illustration of plasmid p3 , H6modMCEA-1st&2nd repeats. 
Figure 8. Illustration of plasmid pNVQH6MCEA(6Dlst&2 nd ). 

Figure 9A-D. Comparison of nucleotide sequence of CAP(6D) and CAP(6D)-1,2. 
Differences between the sequences are underlined. 
25 Figure 10. PCR analysis to confirm the presence of CAP(6D)-1,2 in NYVAC DNA. 

Figure 1 1. Immunoblot illustrating the lack of truncated CEA in cells expressing CAP(6D)- 
1,2. 

30 

DETAILED DESCRIPTION 

The present invention provides reagents and methodologies useful for treating and / or 
preventing cancer. All references cited within this application are incorporated by reference. 
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In one embodiment, the present invention relates to the induction or enhancement of 
an immune response against one or more tumor antigens ("TA") to prevent and / or treat 
cancer. In certain embodiments, one or more TAs may be combined. In preferred 
embodiments, the immune response results from expression of a TA in a host cell following 

5 administration of a nucleic acid vector encoding the tumor antigen or the tumor antigen itself 
in the form of a peptide or polypeptide, for example. 

As used herein, an "antigen" is a molecule (such as a polypeptide) or a portion thereof 
that produces an immune response in a host to whom the antigen has been administered. The 
immune response may include the production of antibodies that bind to at least one epitope of 

10 the antigen and / or the generation of a cellular immune response against cells expressing an 
epitope of the antigen. The response may be an enhancement of a current immune response 
by, for example, causing increased antibody production, production of antibodies with 
increased affinity for the antigen, or an increased cellular response (i.e., increased T cells). 
An antigen that produces an immune response may alternatively be referred to as being 

15 immunogenic or as an immunogen. In describing the present invention, a TA may be 
referred to as an "immunogenic target". 

TA includes both tumor-associated antigens (TAAs) and tumor-specific antigens 
(TSAs), where a cancerous cell is the source of the antigen. A TAA is an antigen that is 
expressed on the surface of a tumor cell in higher amounts than is observed on normal cells 

20 or an antigen that is expressed on normal cells during fetal development. A TSA is an 
antigen that is unique to tumor cells and is not expressed on normal cells. TA further 
includes TAAs or TSAs, antigenic fragments thereof, and modified versions that retain their 
antigenicity. 

TAs are typically classified into five categories according to their expression pattern, 
25 function, or genetic origin: cancer-testis (CT) antigens (i.e., MAGE, NY-ESO-1); melanocyte 
differentiation antigens (i.e., Melan A/MART-1, tyrosinase, gplOO); mutational antigens (i.e., 
MUC-1, p53, CDK-4); overexpressed 'self antigens (i.e., HER-2/neu, p53); and, viral 
antigens (i.e., HPV, EBV). For the purposes of practicing the present invention, a suitable 
TA is any TA that induces or enhances an anti-tumor immune response in a host to whom the 
30 TA has been administered. Suitable TAs include, for example, gplOO (Cox et al., Science, 
264:716-719 (1994)), MART- 1 /Melan A (Kawakami et al., J. Exp. Med, 180:347-352 
(1994)), gp75 (TRP-1) (Wang et al., J. Exp. Med., 186:1 131-1 140 (1996)), tyrosinase (Wolfel 
et al., Eur. J. Immunol., 24:759-764 (1994); WO 200175117; WO 200175016; WO 
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200175007), NY-ESO-1 (WO 98/14464; WO 99/18206), melanoma proteoglycan (Hellstrom 
et al., J. Immunol., 130:1467-1472 (1983)), MAGE family antigens (i.e., MAGE-1, 
2,3,4,6,12, 51; Van der Bruggen et al., Science, 254:1643-1647 (1991); U.S. Pat. Nos. 
6,235,525; CN 131961 1), BAGE family antigens (Boel et al., Immunity, 2:167-175 (1995)), 

5 GAGE family antigens (i.e., GAGE-1,2; Van den Eynde et a!., J. Exp. Med., 182:689-698 
(1995); U.S. Pat. No. 6,013,765), RAGE family antigens (i.e., RAGE-1; Gaugler et at., 
Immunogenetics, 44:323-330 (1996); U.S. Pat. No. 5,939,526), N- 
acetylglucosaminyltransferase-V (Guilloux et at., J. Exp. Med., 183:1173-1183 (1996)), pl5 
(Robbins et al., J. Immunol. 154:5944-5950 (1995)), B-catenin (Robbins et al., J. Exp. Med., 

10 183:1185-1192 (1996)), MUM-1 (Coulie et al., Proc. Natl. Acad. Sci. USA, 92:7976-7980 
(1995)), cyclin dependent kinase-4 (CDK4) (Wolfel et al., Science, 269:1281-1284 (1995)), 
p21-ras (Fossum et at., Int. J. Cancer, 56:40-45 (1994)), BCR-abl (Bocchia et al., Blood, 
85:2680-2684 (1995)), p53 (Theobald et al., Proc. Natl. Acad. Sci. USA, 92:11993-11997 
(1995)), pl85 HER2/neu (erb-Bl; Fisk et al., J. Exp. Med., 181:2109-2117 (1995)), 

15 epidermal growth factor receptor (EGFR) (Harris et al., Breast Cancer Res. Treat, 29:1-2 

(1994) ), carcinoembryonic antigens (CEA) (Kwong et al., J. Natl. Cancer Inst., 85:982-990 

(1995) U.S. Pat. Nos. 5,756,103; 5,274,087; 5,571,710; 6,071,716; 5,698,530; 6,045,802; EP 
263933; EP 346710; and, EP 784483); carcinoma-associated mutated mucins (i.e., MUC-1 
gene products; Jerome et al., J. Immunol., 151:1654-1662 (1993)); EBNA gene products of 

20 EBV (i.e., EBNA-1; Rickinson et al., Cancer Surveys, 13:53-80 (1992)); E7, E6 proteins of 
human papillomavirus (Ressing et al., J. Immunol, 154:5934-5943 (1995)); prostate specific 
antigen (PSA; Xue et al., The Prostate, 30:73-78 (1997)); prostate specific membrane 
antigen (PSMA; Israeli, et al., Cancer Res., 54:1807-1811 (1994)); idiotypic epitopes or 
antigens, for example, immunoglobulin idiotypes or T cell receptor idiotypes (Chen et al., J. 

25 Immunol., 153:4775-4787 (1994)); KSA (U.S. Patent No. 5,348,887), kinesin 2 (Dietz, et al. 
Biochem Biophys Res Commun 2000 Sep 7;275(3):731-8), HIP-55, TGFP-1 anti-apoptotic 
factor (Toomey, et al. Br J Biomed Sci 200 1;58(3): 177-83), tumor protein D52 (Bryne J. A., 
et al., Genomics, 35:523-532 (1996)), HI FT, NY-BR-1 (WO 01/47959), NY-BR-62, NY- 
BR-75, NY-BR-85, NY-BR-87, NY-BR-96 (Scanlan, M. Serologic and Bioinformatic 

30 Approaches to the Identification of Human Tumor Antigens, in Cancer Vaccines 2000, 
Cancer Research Institute, New York, NY), AAC2-1, or AAC2-2, including "wild-type" (i.e., 
normally encoded by the genome, naturally-occurring), modified, and mutated versions as 
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well as other fragments and derivatives thereof. Any of these TAs may be utilized alone or in 
combination with one another in a co-immunization protocol. 

In certain cases, it may be beneficial to co-immunize patients with both TA and other 
antigens, such as angiogenesis-associated antigens ("AA"). An AA is an immunogenic 

5 molecule (i.e., peptide, polypeptide) associated with cells involved in the induction and / or 
continued development of blood vessels. For example, an AA may be expressed on an 
endothelial cell ("EC"), which is a primary structural component of blood vessels. Where the 
cancer is cancer, it is preferred that that the AA be found within or near blood vessels that 
supply a tumor. Immunization of a patient against an AA preferably results in an anti-AA 

10 immune response whereby angiogenic processes that occur near or within tumors are 
prevented and / or inhibited. 

Exemplary AAs include, for example, vascular endothelial growth factor (i.e., VEGF; 
Bernardini, et al. J. Urol., 2001, 166(4): 1275-9; Starnes, et al. J. Thorac. Cardiovasc. Surg., 
2001, 122(3): 518-23), the VEGF receptor (i.e., VEGF-R, flk-l/KDR; Starnes, et al. J. 

15 Thorac. Cardiovasc. Surg., 2001 , 122(3): 51 8-23), EPH receptors (i.e., EPHA2; Gerety, et al. 
1999, Cell, 4: 403-414), epidermal growth factor receptor (i.e., EGFR; Ciardeillo, et al. Clin. 
Cancer Res., 2001, 7(10): 2958-70), basic fibroblast growth factor (i.e., bFGF; Davidson, et 
al. Clin. Exp. Metastasis 2000,18(6): 501-7; Poon, et al. Am J. Surg., 2001, 1 82(3):298-304), 
platelet-derived cell growth factor (i.e., PDGF-B), platelet-derived endothelial cell growth 

20 factor (PD-ECGF; Hong, et al. J. Mol. Med., 2001, 8(2):141-8), transforming growth factors 
(i.e., TGF-ot; Hong, et al. J. Mol. Med., 2001, 8(2):141-8), endoglin (Balza, et al. Int. J. 
Cancer, 2001, 94: 579-585), Id proteins (Benezra, R. Trends Cardiovasc. Med., 2001, 
1 1(6):237-41), proteases such as uPA, uPAR, and matrix metal loproteinases (MMP-2, MMP- 
9; Djonov, et al. J. Pathol., 2001, 195(2): 147-55), nitric oxide synthase (Am. J. Ophthalmol., 

25 2001, !32(4):551-6), aminopeptidase (Rouslhati, E. Nature Cancer, 2: 84-90, 2002), 
thrombospondins (i.e., TSP-1, TSP-2; Alvarez, et al. Gynecol. Oncol., 2001, 82(2):273-8; 
Seki, et al. Int. J. Oncol., 2001, 19(2):305-10), k-ras (Zhang, et al. Cancer Res., 2001, 
61(16):6050-4), Wnt (Zhang, et al. Cancer Res., 2001, 61(16):6050-4), cyclin-dependent 
kinases (CDKs; Drug Resist. Updat. 2000, 3(2):83-88), microtubules (Timar, et al. 2001. 

30 Path. Oncol Res., 7(2): 85-94), heat shock proteins (i.e., HSP90 (Timar, supra)), heparin- 
binding factors (i.e., heparinase; Gohji, et al. Int. J. Cancer, 2001, 95(5):295-301), synthases 
(i.e., ATP synthase, thymidilate synthase), collagen receptors, integrins (i.e., aop3, aup5, 
alpl, <x2pi, a5pl), the surface proteoglycan NG2, AAC2-1 (SEQ ID NO.:l), or AAC2-2 
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(SEQ ID N0.:2), among others, including "wild-type" (i.e., normally encoded by the 
genome, naturally-occurring), modified, mutated versions as well as other fragments and 
derivatives thereof. Any of these targets may be suitable in practicing the present invention, 
either alone or in combination with one another or with other agents. 

5 In certain embodiments, a nucleic acid molecule encoding an immunogenic target is 

utilized. The nucleic acid molecule may comprise or consist of a nucleotide sequence 
encoding one or more immunogenic targets, or fragments or derivatives thereof, such as that 
contained in a DNA insert in an ATCC Deposit. The term "nucleic acid sequence" or 
"nucleic acid molecule" refers to a DNA or RNA sequence. The term encompasses 

10 molecules formed from any of the known base analogs of DNA and RNA such as, but not 
limited to 4-acetylcytosine, 8-hydroxy-N6-methyladenosine, aziridinyl-cytosine, 
pseudoisocytosine, 5-(carboxyhydroxylmethyl) uracil, 5-fluorouracil, 5-bromouracil, 5- 
carboxymethylaminomethyl-2-thiouracil, 5-carboxy-methylaminomethyluracil, 
dihydrouracil, inosine, N6-iso-pentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1- 

15 methylguanine, I -methyl inosine, 2,2-dimethyl-guanine, 2-methyladenine, 2-methylguanine, 
3-methylcytosine, 5-methylcytosine, N6-methy laden ine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyamino-methyl-2-thiouracil, beta-D-mannosylqueosine, 
5' -methoxycarbonyl-methyluracil, 5-methoxy uracil, 2-methylthio-N6-isopentenyladenine, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, 

20 queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5 -methyl uracil, N- 
uracil-5-oxyacetic acid methylester, uracil-5-oxy acetic acid, pseudouracil, queosine, 2- 
thiocytosine, and 2,6-diaminopurine, among others. 

An isolated nucleic acid molecule is one that: (1) is separated from at least about 50 
percent of proteins, lipids, carbohydrates, or other materials with which it is naturally found 

25 when total nucleic acid is isolated from the source cells; (2) is not be linked to all or a portion 
of a polynucleotide to which the nucleic acid molecule is linked in nature; (3) is operably 
linked to a polynucleotide which it is not linked to in nature; and / or, (4) does not occur in 
nature as part of a larger polynucleotide sequence. Preferably, the isolated nucleic acid 
molecule of the present invention is substantially free from any other contaminating nucleic 

30 acid molecule(s) or other contaminants that are found in its natural environment that would 
interfere with its use in polypeptide production or its therapeutic, diagnostic, prophylactic or 
research use. As used herein, the term "naturally occurring" or "native" or "naturally found" 
when used in connection with biological materials such as nucleic acid molecules, 



polypeptides, host cells, and the like, refers to materials which are found in nature and are not 
manipulated by man. Similarly, "non-natural ly occurring" or "non-native" as used herein 
refers to a material that is not found in nature or that has been structurally modified or 
synthesized by man. 

5 The identity of two or more nucleic acid or polypeptide molecules is determined by 

comparing the sequences. As known in the art, "identity" means the degree of sequence 
relatedness between nucleic acid molecules or polypeptides as determined by the match 
between the units making up the molecules (i.e., nucleotides or amino acid residues). Identity 
measures the percent of identical matches between the smaller of two or more sequences with 

10 gap alignments (if any) addressed by a particular mathematical model or computer program 
(i.e., an algorithm). Identity between nucleic acid sequences may also be determined by the 
ability of the related sequence to hybridize to the nucleic acid sequence or isolated nucleic 
acid molecule. In defining such sequences, the term "highly stringent conditions" and 
"moderately stringent conditions" refer to procedures that permit hybridization of nucleic 

15 acid strands whose sequences are complementary, and to exclude hybridization of 
significantly mismatched nucleic acids. Examples of "highly stringent conditions" for 
hybridization and washing are 0.015 M sodium chloride, 0.0015 M sodium citrate at 65-68°C 
or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 50% formamide at 42°C. (see, for 
example, Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual (2nd ed., 

20 Cold Spring Harbor Laboratory, 1989); Anderson et al, Nucleic Acid Hybridisation: A 
Practical Approach Ch. 4 (IRL Press Limited)). The term "moderately stringent conditions" 
refers to conditions under which a DNA duplex with a greater degree of base pair 
mismatching than could occur under "highly stringent conditions" is able to form. 
Exemplary moderately stringent conditions are 0.015 M sodium chloride, 0.0015 M sodium 

25 citrate at 50-65°C or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 20% formamide 
at 37-50°C. By way of example, moderately stringent conditions of 50°C in 0.015 M sodium 
ion will allow about a 21% mismatch. During hybridization, other agents may be included in 
the hybridization and washing buffers for the purpose of reducing non-specific and/or 
background hybridization. Examples are 0.1% bovine serum albumin, 0.1% polyvinyl- 

30 pyrrolidone, 0.1% sodium pyrophosphate, 0.1% sodium dodecyl sulfate, NaDodS0 4 , (SDS), 
ficoll, Denhardt's solution, sonicated salmon sperm DNA (or another non-complementary 
DNA), and dextran sulfate, although other suitable agents can also be used. The 
concentration and types of these additives can be changed without substantially affecting the 
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stringency of the hybridization conditions. Hybridization experiments are usually carried out 
at pH 6.8-7.4; however, at typical ionic strength conditions, the rate of hybridization is nearly 
independent of pH. 

In preferred embodiments of the present invention, vectors are used to transfer a 

5 nucleic acid sequence encoding a polypeptide to a cell. A vector is any molecule used to 
transfer a nucleic acid sequence to a host cell. In certain cases, an expression vector is 
utilized. An expression vector is a nucleic acid molecule that is suitable for transformation of 
a host cell and contains nucleic acid sequences that direct and / or control the expression of 
the transferred nucleic acid sequences. Expression includes, but is not limited to, processes 

10 such as transcription, translation, and splicing, if introns are present. Expression vectors 
typically comprise one or more flanking sequences operably linked to a heterologous nucleic 
acid sequence encoding a polypeptide. Flanking sequences may be homologous (i.e., from 
the same species and / or strain as the host cell), heterologous (i.e., from a species other than 
the host cell species or strain), hybrid (i.e., a combination of flanking sequences from more 

15 than one source), or synthetic, for example. 

A flanking sequence is preferably capable of effecting the replication, transcription 
and / or translation of the coding sequence and is operably linked to a coding sequence. As 
used herein, the term operably linked refers to a linkage of polynucleotide elements in a 
functional relationship. For instance, a promoter or enhancer is operably linked to a coding 

20 sequence if it affects the transcription of the coding sequence. However, a flanking sequence 
need not necessarily be contiguous with the coding sequence, so long as it functions 
correctly. Thus, for example, intervening untranslated yet transcribed sequences can be 
present between a promoter sequence and the coding sequence and the promoter sequence 
may still be considered operably linked to the coding sequence. Similarly, an enhancer 

25 sequence may be located upstream or downstream from the coding sequence and affect 
transcription of the sequence. 

In certain embodiments, it is preferred that the flanking sequence is a trascriptional 
regulatory region that drives high-level gene expression in the target cell. The transcriptional 
regulatory region may comprise, for example, a promoter, enhancer, silencer, repressor 

30 element, or combinations thereof. The transcriptional regulatory region may be either 
constitutive, tissue-specific, cell-type specific (i.e., the region is drives higher levels of 
transcription in a one type of tissue or cell as compared to another), or regulatable (i.e., 
responsive to interaction with a compound such as tetracycline). The source of a 



transcriptional regulatory region may be any prokaryotic or eukaryotic organism, any 
vertebrate or invertebrate organism, or any plant, provided that the flanking sequence 
functions in a cell by causing transcription of a nucleic acid within that cell. A wide variety 
of transcriptional regulatory regions may be utilized in practicing the present invention. 

5 Suitable transcriptional regulatory regions include the CMV promoter (i.e., the CMV- 

immediate early promoter); promoters from eukaryotic genes (i.e., the estrogen-inducible 
chicken ovalbumin gene, the interferon genes, the gluco-corticoid-inducible tyrosine 
aminotransferase gene, and the thymidine kinase gene); and the major early and late 
adenovirus gene promoters; the SV40 early promoter region (Bernoist and Chambon, 1981, 

10 Nature 290:304-10); the promoter contained in the 3' long terminal repeat (LTR) of Rous 
sarcoma virus (RSV) (Yamamoto, et al, 1980, Cell 22:787-97); the herpes simplex virus 
thymidine kinase (HSV-TK) promoter (Wagner et al, 1981, Proc. Natl. Acad. Sci. U.S.A. 
78:1444-45); the regulatory sequences of the metallothionine gene (Brinster et al, 1982, 
Nature 296:39-42); prokaryotic expression vectors such as the beta-lactamase promoter 

15 (Villa-KamarofTef ai, 1978, Proc. Natl. Acad ScL U.S.A., 75:3727-31); or the tac promoter 
(DeBoer et al, 1983, Proc. Natl. Acad Sci. U.S.A., 80:21-25). Tissue- and / or cell-type 
specific transcriptional control regions include, for example, the elastase 1 gene control region 
which is active in pancreatic acinar cells (Swift et al., 1984, Cell 38:639-46; Ornitz et al., 
1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409 (1986); MacDonald, 1987, 

20 Hepatology 7:425-515); the insulin gene control region which is active in pancreatic beta 
cells (Hanahan, 1985, Nature 3 15:1 15-22); the immunoglobulin gene control region which is 
active in lymphoid cells (Grosschedl et ai, 1984, Cell 38:647-58; Adames et al., 1985, 
Nature 318:533-38; Alexander et al., 1987, Mol. Cell. Biol, 7:1436-44); the mouse 
mammary tumor virus control region in testicular, breast, lymphoid and mast cells (Leder et 

25 ai, 1986, Cell 45:485-95); the albumin gene control region in liver (Pinkert et al, 1987, 
Genes and Devel. 1 :268-76); the alpha-feto-protein gene control region in liver (Krumlauf et 
al., 1985, Mot. Cell. Biol., 5:1639-48; Hammer et al., 1987, Science 235:53-58); the alpha 1- 
antitrypsin gene control region in liver (Kelsey et al., 1987, Genes and Devel. 1 : 161-71); the 
beta-globin gene control region in myeloid cells (Mogram et al., 1985, Nature 315:338-40; 

30 Kollias et al, 1986, Cell 46:89-94); the myelin basic protein gene control region in 
oligodendrocyte cells in the brain (Readhead et al, 1987, Cell 48:703-12); the myosin light 
chain-2 gene control region in skeletal muscle (Sani, 1985, Nature 314:283-86); the 
gonadotropic releasing hormone gene control region in the hypothalamus (Mason et al, 
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1986, Science 234:1372-78), and the tyrosinase promoter in melanoma cells (Hart, I. Semin 
Oncol 1996 Feb;23(l): 154-8; Siders, et al. Cancer Gene Ther 1998 Sep-Oct;5(5):28I-91), 
among others. Other suitable promoters are known in the art. 

As described above, enhancers may also be suitable flanking sequences. Enhancers 

5 are cis-acting elements of DNA, usually about 10-300 bp in length, that act on the promoter 
to increase transcription. Enhancers are typically orientation- and position-independent, 
having been identified both 5' and 3' to controlled coding sequences. Several enhancer 
sequences available from mammalian genes are known (i.e., globin, elastase, albumin, alpha- 
feto-protein and insulin). Similarly, the SV40 enhancer, the cytomegalovirus early promoter 

10 enhancer, the polyoma enhancer, and adenovirus enhancers are useful with eukaryotic 
promoter sequences. While an enhancer may be spliced into the vector at a position 5' or 3' 
to nucleic acid coding sequence, it is typically located at a site 5' from the promoter. Other 
suitable enhancers are known in the art, and would be applicable to the present invention. 

While preparing reagents of the present invention, cells may need to be transfected or 

15 transformed. Transfection refers to the uptake of foreign or exogenous DNA by a cell, and a 
cell has been transfected when the exogenous DNA has been introduced inside the cell 
membrane. A number of transfection techniques are well known in the art (i.e., Graham et 
al., 1973, Virology 52:456; Sambrook et al, Molecular Cloning, A Laboratory Manual (Cold 
Spring Harbor Laboratories, 1989); Davis et al., Basic Methods in Molecular Biology 

20 (Elsevier, 1986); and Chu et al., 1981, Gene 13:197). Such techniques can be used to 
introduce one or more exogenous DNA moieties into suitable host cells. 

In certain embodiments, it is preferred that transfection of a cell results in 
transformation of that cell. A cell is transformed when there is a change in a characteristic of 
the cell, being transformed when it has been modified to contain a new nucleic acid. 

25 Following transfection, the transfected nucleic acid may recombine with that of the cell by 
physically integrating into a chromosome of the cell, may be maintained transiently as an 
episomal element without being replicated, or may replicate independently as a plasmid. A 
cell is stably transformed when the nucleic acid is replicated with the division of the cell. 

The present invention further provides isolated immunogenic targets in polypeptide 

30 form. A polypeptide is considered isolated where it: (1) has been separated from at least 
about 50 percent of polynucleotides, lipids, carbohydrates, or other materials with which it is 
naturally found when isolated from the source cell; (2) is not linked (by covalent or 
noncovalent interaction) to all or a portion of a polypeptide to which the "isolated 

10 



polypeptide" is linked in nature; (3) is operably linked (by covalent or noncovalent 
interaction) to a polypeptide with which it is not linked in nature; or, (4) does not occur in 
nature. Preferably, the isolated polypeptide is substantially free from any other 
contaminating polypeptides or other contaminants that are found in its natural environment 

5 that would interfere with its therapeutic, diagnostic, prophylactic or research use. 

Immunogenic target polypeptides may be mature polypeptides, as defined herein, and 
may or may not have an amino terminal methionine residue, depending on the method by 
which they are prepared. Further contemplated are related polypeptides such as, for example, 
fragments, variants (i.e., allelic, splice), orthologs, homologues, and derivatives, for example, 

10 that possess at least one characteristic or activity (i.e., activity, antigenicity) of the 
immunogenic target. Also related are peptides, which refers to a series of contiguous amino 
acid residues having a sequence corresponding to at least a portion of the polypeptide from 
which its sequence is derived. In preferred embodiments, the peptide comprises about 5-10 
amino acids, 10-15 amino acids, 15-20 amino acids, 20-30 amino acids, or 30-50 amino 

15 acids. In a more preferred embodiment, a peptide comprises 9-12 amino acids, suitable for 
presentation upon Class 1 MHC molecules, for example. 

A fragment of a nucleic acid or polypeptide comprises a truncation of the sequence 
(i.e., nucleic acid or polypeptide) at the amino terminus (with or without a leader sequence) 
and / or the carboxy terminus. Fragments may also include variants (i.e., allelic, splice), 

20 orthologs, homologues, and other variants having one or more amino acid additions or 
substitutions or internal deletions as compared to the parental sequence. In preferred 
embodiments, truncations and/or deletions comprise about 10 amino acids, 20 amino acids, 
30 amino acids, 40 amino acids, 50 amino acids, or more. The polypeptide fragments so 
produced will comprise about 10 amino acids, 25 amino acids, 30 amino acids, 40 amino 

25 acids, 50 amino acids, 60 amino acids, 70 amino acids, or more. Such polypeptide fragments 
may optionally comprise an amino terminal methionine residue. It will be appreciated that 
such fragments can be used, for example, to generate antibodies or cellular immune responses 
to immunogenic target polypeptides. 

A variant is a sequence having one or more sequence substitutions, deletions, and/or 

30 additions as compared to the subject sequence. Variants may be naturally occurring or 
artificially constructed. Such variants may be prepared from the corresponding nucleic acid 
molecules. In preferred embodiments, the variants have from 1 to 3, or from 1 to 5, or from 1 
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to 10, or from 1 to 15, or from 1 to 20, or from 1 to 25, or from 1 to 30, or from 1 to 40, or 
from 1 to 50, or more than 50 amino acid substitutions, insertions, additions and/or deletions. 

An allelic variant is one of several possible naturally-occurring alternate forms of a 
gene occupying a given locus on a chromosome of an organism or a population of organisms. 

5 A splice variant is a polypeptide generated from one of several RNA transcript resulting from 
splicing of a primary transcript. An ortholog is a similar nucleic acid or polypeptide 
sequence from another species. For example, the mouse and human versions of an 
immunogenic target polypeptide may be considered orthologs of each other. A derivative of a 
sequence is one that is derived from a parental sequence those sequences having 

10 substitutions, additions, deletions, or chemically modified variants. Variants may also 
include fusion proteins, which refers to the fusion of one or more first sequences (such as a 
peptide) at the amino or carboxy terminus of at least one other sequence (such as a 
heterologous peptide). 

"Similarity" is a concept related to identity, except that similarity refers to a measure 

15 of relatedness which includes both identical matches and conservative substitution matches. 
If two polypeptide sequences have, for example, 10/20 identical amino acids, and the 
remainder are all non -conservative substitutions, then the percent identity and similarity 
would both be 50%. If in the same example, there are five more positions where there are 
conservative substitutions, then the percent identity remains 50%, but the percent similarity 

20 would be 75% (15/20). Therefore, in cases where there are conservative substitutions, the 
percent similarity between two polypeptides will be higher than the percent identity between 
those two polypeptides. 

Substitutions may be conservative, or non-conservative, or any combination thereof. 
Conservative amino acid modifications to the sequence of a polypeptide (and the 

25 corresponding modifications to the encoding nucleotides) may produce polypeptides having 
functional and chemical characteristics similar to those of a parental polypeptide. For 
example, a "conservative amino acid substitution" may involve a substitution of a native 
amino acid residue with a non-native residue such that there is little or no effect on the size, 
polarity, charge, hydrophobicity, or hydrophilicity of the amino acid residue at that position 

30 and, in particlar, does not result in decreased immunogenicity. Suitable conservative amino 
acid substitutions are shown in Table I. 

Table 1 
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A skilled artisan will be able to determine suitable variants of polypeptide using well- 
known techniques. For identifying suitable areas of the molecule that may be changed 
without destroying biological activity (i.e., MHC binding, immunogenicity), one skilled in 

5 the art may target areas not believed to be important for that activity. For example, when 
similar polypeptides with similar activities from the same species or from other species are 
known, one skilled in the art may compare the amino acid sequence of a polypeptide to such 
similar polypeptides. By performing such analyses, one can identify residues and portions of 
the molecules that are conserved among similar polypeptides. It will be appreciated that 

10 changes in areas of the molecule that are not conserved relative to such similar polypeptides 
would be less likely to adversely affect the biological activity and/or structure of a 
polypeptide. Similarly, the residues required for binding to MHC are known, and may be 
modified to improve binding. However, modifications resulting in decreased binding to 
MHC will not be appropriate in most situations. One skilled in the art would also know that, 

15 even in relatively conserved regions, one may substitute chemically similar amino acids for 
the naturally occurring residues while retaining activity. Therefore, even areas that may be 
important for biological activity or for structure may be subject to conservative amino acid 
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substitutions without destroying the biological activity or without adversely affecting the 
polypeptide structure. 

Other preferred polypeptide variants include glycosylation variants wherein the 
number and/or type of glycosylation sites have been altered compared to the subject amino 

5 acid sequence. In one embodiment, polypeptide variants comprise a greater or a lesser 
number of N-linked glycosylation sites than the subject amino acid sequence. An N-linked 
glycosylation site is characterized by the sequence Asn-X-Ser or Asn-X-Thr, wherein the 
amino acid residue designated as X may be any amino acid residue except proline. The 
substitution of amino acid residues to create this sequence provides a potential new site for 

10 the addition of an N-linked carbohydrate chain. Alternatively, substitutions that eliminate 
this sequence will remove an existing N-linked carbohydrate chain. Also provided is a 
rearrangement of N-linked carbohydrate chains wherein one or more N-linked glycosylation 
sites (typically those that are naturally occurring) are eliminated and one or more new N- 
linked sites are created. To affect O-linked glycosylation of a polypeptide, one would modify 

1 5 serine and / or threonine residues. 

Additional preferred variants include cysteine variants, wherein one or more cysteine 
residues are deleted or substituted with another amino acid (e.g., serine) as compared to the 
subject amino acid sequence set. Cysteine variants are useful when polypeptides must be 
refolded into a biologically active conformation such as after the isolation of insoluble 

20 inclusion bodies. Cysteine variants generally have fewer cysteine residues than the native 
protein, and typically have an even number to minimize interactions resulting from unpaired 
cysteines. 

In other embodiments, the isolated polypeptides of the current invention include 
fusion polypeptide segments that assist in purification of the polypeptides. Fusions can be 

25 made either at the amino terminus or at the carboxy terminus of the subject polypeptide 
variant thereof. Fusions may be direct with no linker or adapter molecule or may be through 
a linker or adapter molecule. A linker or adapter molecule may be one or more amino acid 
residues, typically from about 20 to about 50 amino acid residues. A linker or adapter 
molecule may also be designed with a cleavage site for a DNA restriction endonuclease or for 

30 a protease to allow for the separation of the fused moieties. It will be appreciated that once 
constructed, the fusion polypeptides can be derivatized according to the methods described 
herein. Suitable fusion segments include, among others, metal binding domains (e.g., a 
poly-histidine segment), immunoglobulin binding domains (i.e., Protein A, Protein G, T cell, 



B cell, Fc receptor, or complement protein antibody-binding domains), sugar binding 
domains (e.g., a maltose binding domain), and/or a "tag" domain (i.e., at least a portion of 
ot-galactosidase, a strep tag peptide, a T7 tag peptide, a FLAG peptide, or other domains that 
can be purified using compounds that bind to the domain, such as monoclonal antibodies). 

5 This tag is typically fused to the polypeptide upon expression of the polypeptide, and can 
serve as a means for affinity purification of the sequence of interest polypeptide from the host 
cell. Affinity purification can be accomplished, for example, by column chromatography 
using antibodies against the tag as an affinity matrix. Optionally, the tag can subsequently be 
removed from the purified sequence of interest polypeptide by various means such as using 

10 certain peptidases for cleavage. As described below, fusions may also be made between a TA 
and a co-stimulatory components such as the chemokines CXC10 (IP- 10), CCL7 (MCP-3), or 
CCL5 (RANTES), for example. 

A fusion motif may enhance transport of an immunogenic target to an MHC 
processing compartment, such as the endoplasmic reticulum. These sequences, referred to as 

15 tranduction or transcytosis sequences, include sequences derived from HIV tat (see Kim et al. 
1997 J. Immunol. 159:1666), Drosophila antennapedia (see Schutze-Redelmeier et al. 1996 J. 
Immunol. 157:650), or human period- 1 protein (hPERl; in particular, 
SRRHHCRSKAKRSRHH). 

In addition, the polypeptide or variant thereof may be fused to a homologous 

20 polypeptide to form a homodimer or to a heterologous polypeptide to form a heterodimer. 
Heterologous peptides and polypeptides include, but are not limited to: an epitope to allow 
for the detection and/or isolation of a fusion polypeptide; a transmembrane receptor protein 
or a portion thereof, such as an extracellular domain or a transmembrane and intracellular 
domain; a ligand or a portion thereof which binds to a transmembrane receptor protein; an 

25 enzyme or portion thereof which is catalytically active; a polypeptide or peptide which 
promotes oligomerization, such as a leucine zipper domain; a polypeptide or peptide which 
increases stability, such as an immunoglobulin constant region; and a polypeptide which has 
a therapeutic activity different from the polypeptide or variant thereof. 

In certain embodiments, it may be advantageous to combine a nucleic acid sequence 

30 encoding an immunogenic target, polypeptide, or derivative thereof with one or more co- 
stimulatory component(s) such as cell surface proteins, cytokines or chemokines in a 
composition of the present invention. The co-stimulatory component may be included in the 
composition as a polypeptide or as a nucleic acid encoding the polypeptide, for example. 

15 



Suitable co-stimulatory molecules include, for instance, polypeptides that bind members of 
the CD28 family (i.e., CD28, ICOS; HutlofT, et al. Nature 1999, 397: 263-265; Peach, et al. 
J Exp Med 1994, 180: 2049-2058) such as the CD28 binding polypeptides B7.1 (CD80; 
Schwartz, 1992; Chen et al, 1992; Ellis, et al. J. Immunol., 156(8): 2700-9) and B7.2 (CD86; 

5 Ellis, et al. J. Immunol, 156(8): 2700-9); polypeptides which bind members of the integrin 
family (i.e., LFA-1 (CDlla / CD18); Sedwick, et al. J Immunol 1999, 162: 1367-1375; 
Waiting, et al. Science 1998, 282: 2266-2269; Lub, et al. Immunol Today 1995, 16: 479- 
483) including members of the ICAM family (i.e., ICAM-1, -2 or -3); polypeptides which 
bind CD2 family members (i.e., CD2, signalling lymphocyte activation molecule (CDwl50 

10 or "SLAM"; Aversa, et al. 

J Immunol 1997, 158: 4036^044)) such as CD58 (LFA-3; CD2 ligand; Davis, et al. 
Immunol Today 1996, 17: 177-187) or SLAM ligands (Sayos, et al. Nature 1998, 395: 462- 
469); polypeptides which bind heat stable antigen (HSA or CD24; Zhou, et al. Eur J 
Immunol 1997, 27: 2524-2528); polypeptides which bind to members of the TNF receptor 

15 (TNFR) family (i.e., 4-1 BB (CD137; Vinay, et al. Semin Immunol 1998, 10: 481^89), 
OX40 (CD134; Weinberg, et al. Semin Immunol 1998, 10: 471^80; Higgins, et al. J 
Immunol 1999, 162: 486-493), and CD27 (Lens, et al. Semin Immunol 1998, 10: 491-499)) 
such as 4-1 BBL (4- IBB ligand; Vinay, et al. Semin Immunol 1998, 10: 481^48; 
DeBenedette, et al. J Immunol 1997, 158: 551-559), TNFR associated factor-1 (TRAF-1; 4- 

20 IBB ligand; Saoulli, et al. J Exp Med 1998, 187: 1849-1862, Arch, et al. Mol Cell Biol 
1998, 18: 558-565), TRAF-2 (4-1 BB and OX40 ligand; Saoulli, et al. J Exp Med 1998, 187: 
1849-1862; Oshima, et al. Int Immunol 1998, 10: 517-526, Kawamata, et al. J Biol Chem 
1998, 273: 5808-5814), TRAF-3 (4-1 BB and OX40 ligand; Arch, et al. Mol Cell Biol 1998, 
18: 558-565; Jang, et al. Biochem Biophys Res Commun 1998, 242: 613-620; Kawamata S, 

25 et al. J Biol Chem 1998, 273: 5808-5814), OX40L (OX40 ligand; Gramaglia, et al. J 
Immunol 1998, 161: 6510-6517), TRAF-5 (OX40 ligand; Arch, et al. Mol Cell Biol 1998, 
18: 558-565; Kawamata, et al. J Biol Chem 1998, 273: 5808-5814), and CD70 (CD27 
ligand; Couderc, et al. Cancer Gene Then, 5(3): 163-75). CD154 (CD40 ligand or 
"CD40L"; Gurunathan, et al. J. Immunol., 1998, 161: 4563-4571; Sine, et al. Hum. Gene 

30 77?er.,2001, 12: 1091-1 102) may also be suitable. 

One or more cytokines may also be suitable co-stimulatory components or 
"adjuvants", either as polypeptides or being encoded by nucleic acids contained within the 
compositions of the present invention (Parmiani, et al. Immunol Lett 2000 Sep 15; 74(1): 41- 

16 



4; Berzofsky, et al. Nature Immunol. 1: 209-219). Suitable cytokines include, for example, 
interleukin-2 (1L-2) (Rosenberg, et al. Nature Med. 4: 321-327 (1998)), IL-4, IL-7, IL-12 
(reviewed by Pardoll, 1992; Harries, et al. J. Gene Med. 2000 Jul-Aug;2(4):243-9; Rao, et al. 
J. Immunol. 156: 3357-3365 (1996)), IL-15 (Xin, et al. Vaccine, 17:858-866, 1999), 1L-16 

5 (Cruikshank, et al. J. Leuk Biol. 67(6): 757-66, 2000), 1L-18 (J. Cancer Res. Clin. Oncol 
2001. 127(12): 718-726), GM-CSF (CSF (Disis, et al. Blood, 88: 202-210 (1996)), tumor 
necrosis factor-alpha (TNF-a), or interferon-gamma (INF-y). Other cytokines may also be 
suitable for practicing the present invention, as is known in the art. 

Chemokines may also be utilized. For example, fusion proteins comprising CXCL10 

10 (1P-10) and CCL7 (MCP-3) fused to a tumor self-antigen have been shown to induce anti- 
tumor immunity (Biragyn, et al. Nature Biotech. 1999, 17: 253-258). The chemokines 
CCL3 (MlP-lot) and CCL5 (RANTES) (Boyer, et al. Vaccine, 1999, 17 (Supp. 2): S53-S64) 
may also be of use in practicing the present invention. Other suitable chemokines are known 
in the art. 

15 It is also known in the art that suppressive or negative regulatory immune 

mechanisms may be blocked, resulting in enhanced immune responses. For instance, 
treatment with anti-CTLA-4 (Shrikant, et al. Immunity, 1996, 14: 145-155; Sutmuller, et al. 
J. Exp. Med., 2001, 194: 823-832), anti-CD25 (Sutmuller, supra), anti-CD4 (Matsui, et al. J. 
Immunol, 1999, 163: 184-193), the fusion protein IL13Ra2-Fc (Terabe, et al. Nature 

20 Immunol, 2000, I: 515-520), and combinations thereof (i.e., anti-CTLA-4 and anti-CD25, 
Sutmuller, supra) have been shown to upregulate anti-tumor immune responses and would be 
suitable in practicing the present invention. 

Any of these components may be used alone or in combination with other agents. For 
instance, it has been shown that a combination of CD80, ICAM-1 and LFA-3 ("TRICOM") 

25 may potentiate anti-cancer immune responses (Hodge, et al. Cancer Res. 59: 5800-5807 
(1999). Other effective combinations include, for example, IL-12 + GM-CSF (Ahlers, et al. 
J. Immunol, 158: 3947-3958 (1997); Iwasaki, et al. J. Immunol 158: 4591-4601 (1997)), IL- 
12 + GM-CSF + TNF-a (Ahlers, et al. Int. Immunol 13: 897-908 (2001)), CD80 + IL-12 
(Fruend, et al. Int. J. Cancer, 85: 508-517 (2000); Rao, et al. supra), and CD86 + GM-CSF + 

30 IL-12 (Iwasaki, supra). One of skill in the art would be aware of additional combinations 
useful in carrying out the present invention. In addition, the skilled artisan would be aware of 
additional reagents or methods that may be used to modulate such mechanisms. These 



17 



reagents and methods, as well as others known by those of skill in the art, may be utilized in 
practicing the present invention. 

Additional strategies for improving the efficiency of nucleic acid-based immunization 
may also be used including, for example, the use of self-replicating viral replicons (Caley, et 

5 al. 1999. Vaccine, 17: 3124-2135; Dubensky, et al. 2000. Mol Med. 6: 723-732; Leitner, et 
al. 2000. Cancer Res. 60:51-55), codon optimization (Liu, etal. 2000. Mol. Ther., 1: 497- 
500; Dubensky, supra; Huang, et al. 2001 . J. Virol 75: 4947-4951), in vivo electroporation 
(Widera, et al. 2000. J. Immunol 164: 4635-3640), incorporation of CpG stimulatory motifs 
(Gurunathan, et al. Ann. Rev. Immunol, 2000, 18: 927-974; Leitner, supra), sequences for 

10 targeting of the endocytic or ubiquitin-processing pathways (Thomson, et al. 1998. J. Virol 
72: 2246-2252; Velders, et al. 2001. J. Immunol 166: 5366-5373), prime-boost regimens 
(Gurunathan, supra; Sullivan, et al. 2000. Nature, 408: 605-609; Hanke, et al. 1998. 
Vaccine, 16: 439-445; Amara, et al. 2001. Science, 292: 69-74), and the use of mucosal 
delivery vectors such as Salmonella (Darji, et al. 1 997. Cell, 91 : 765-775; Woo, et al. 2001 . 

15 Vaccine, 19: 2945-2954). Other methods are known in the art, some of which are described 
below. 

Chemotherapeutic agents, radiation, anti -angiogenic compounds, or other agents may 
also be utilized in treating and / or preventing cancer using immunogenic targets (Sebti, et al. 
Oncogene 2000 Dec 27;19(56):6566-73). For example, in treating metastatic breast cancer, 

20 useful chemotherapeutic agents include cyclophosphamide, doxorubicin, paclitaxel, 
docetaxel, navelbine, capecitabine, and mitomycin C, among others. Combination 
chemotherapeutic regimens have also proven effective including cyclophosphamide + 
methotrexate + 5-fluorouracil; cyclophosphamide + doxorubicin + 5-fluorouracil; or, 
cyclophosphamide + doxorubicin, for example. Other compounds such as prednisone, a 

25 taxane, navelbine, mitomycin C, or vinblastine have been utlized for various reasons. A 
majority of breast cancer patients have estrogen -receptor positive (ER+) tumors and in these 
patients, endocrine therapy (i.e., tamoxifen) is preferred over chemotherapy. For such 
patients, tamoxifen or, as a second line therapy, progestins (medroxyprogesterone acetate or 
megestrol acetate) are preferred. Aromatase inhibitors (i.e., aminoglutethimide and analogs 

30 thereof such as letrozole) decrease the availability of estrogen needed to maintain tumor 
growth and may be used as second or third line endocrine therapy in certain patients. 

Other cancers may require different chemotherapeutic regimens. For example, 
metastatic colorectal cancer is typically treated with Camptosar (irinotecan or CPT-11), 5- 

18 



fluorouracil or leucovorin, alone or in combination with one another. Proteinase and integrin 
inhibitors such as as the MMP inhibitors marimastate (British Biotech), COL-3 (Collagenex), 
Neovastat (Aeterna), AG3340 (Agouron), BMS-275291 (Bristol Myers Squibb), CGS 
27023A (Novartis) or the integrin inhibitors Vitaxin (Medimmune), or MED1522 (Merck 
5 KgaA) may also be suitable for use. As such, immunological targeting of immunogenic 
targets associated with colorectal cancer could be performed in combination with a treatment 
using those chemotherapeutic agents. Similarly, chemotherapeutic agents used to treat other 
types of cancers are well-known in the art and may be combined with the immunogenic 
targets described herein. 

10 Many anti-angiogenic agents are known in the art and would be suitable for co- 

administration with the immunogenic target vaccines (see, for example, Timar, et al. 2001. 
Pathology Oncol. Res., 7(2): 85-94). Such agents include, for example, physiological agents 
such as growth factors (i.e., ANG-2, NK1,2,4 (HGF), transforming growth factor beta (TGF- 
(3)), cytokines (i.e., interferons such as IFN-ct, -p, -y, platelet factor 4 (PF-4), PR-39), 

15 proteases (i.e., cleaved AT-III, collagen XVIII fragment (Endostatin)), HmwKallikrein-d5 
plasmin fragment (Angiostatin), prothrombin-FI-2, TSP-1), protease inhibitors (i.e., tissue 
inhibitor of metal loproteases such as TIMP-1, -2, or -3; maspin; plasminogen activator- 
inhibitors such as PA1-1; pigment epithelium derived factor (PEDF)), Tumstatin (available 
through ILEX, Inc.), antibody products (i.e., the collagen-binding antibodies HUIV26, 

20 HUI77, XL313; anti-VEGF; anti-integrin (i.e., Vitaxin, (Lxsys))), and glycosidases (i.e., 
heparinase-I, -III). "Chemical" or modified physiological agents known or believed to have 
anti-angiogenic potential include, for example, vinblastine, taxol, ketoconazole, thalidomide, 
dolestatin, combrestatin A, rapamycin (Guba, et al. 2002, Nature Med, 8: 128-135), CEP- 
7055 (available from Cephalon, Inc.), flavone acetic acid, Bay 12-9566 (Bayer Corp.), 

25 AG3340 (Agouron, Inc.), CGS 27023A (Novartis), tetracylcine derivatives (i.e., COL-3 
(Collagenix, Inc.)), Neovastat (Aeterna), BMS-275291 (Bristol-Myers Squibb), low dose 5- 
FU, low dose methotrexate (MTX), irsofladine, radicicol, cyclosporine, captopril, celecoxib, 
D45152-sulphated polysaccharide, cationic protein (Protamine), cationic peptide-VEGF, 
Suramin (polysulphonated napthyl urea), compounds that interfere with the function or 

30 production of VEGF (i.e., SU5416 or SU6668 (Sugen), PTK787/ZK22584 (Novartis)), 
Distamycin A, Angiozyme (ribozyme), isoflavinoids, staurosporine derivatives, genistein, 
EMD121974 (Merck KcgaA), tyrphostins, isoquinolones, retinoic acid, 
carboxyamidotriazole, TNP-470, octreotide, 2-methoxyestradiol, aminosterols (i.e., 

19 



squalamine), glutathione analogues (i.e., N-acteyl-L-cysteine), combretastatin A-4 (Oxigene), 
Eph receptor blocking agents {Nature, 414:933-938, 2001), Rh-Angiostatin, Rh-Endostatin 
(WO 01/93897), cyclic-RGD peptide, accutin-disintegrin, benzodiazepenes, humanized anti- 
avb3 Ab, Rh-PAI-2, amiloride, p-amidobenzamidine, anti-uPA ab, anti-uPAR Ab, L- 
5 phanylalanin-N-methylamides (i.e., Batimistat, Marimastat), AG3340, and minocycline. 
Many other suitable agents are known in the art and would suffice in practicing the present 
invention. 

The present invention may also be utilized in combination with "non-traditional" 
methods of treating cancer. For example, it has recently been demonstrated that 

10 administration of certain anaerobic bacteria may assist in slowing tumor growth. In one 
study, Clostridium novyi was modified to eliminate a toxin gene carried on a phage episome 
and administered to mice with colorectal tumors (Dang, et al. P.N.A.S. USA, 98(26): 15155- 
15160, 2001). In combination with chemotherapy, the treatment was shown to cause tumor 
necrosis in the animals. The reagents and methodologies described in this application may be 

1 5 combined with such treatment methodologies. 

Nucleic acids encoding immunogenic targets may be administered to patients by any 
of several available techniques. Various viral vectors that have been successfully utilized for 
introducing a nucleic acid to a host include retrovirus, adenovirus, adeno-associated virus 
(AAV), herpes virus, and poxvirus, among others. It is understood in the art that many such 

20 viral vectors are available in the art. The vectors of the present invention may be constructed 
using standard recombinant techniques widely available to one skilled in the art. Such 
techniques may be found in common molecular biology references such as Molecular 
Cloning: A Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory 
Press), Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. 

25 Goeddel, 1991 . Academic Press, San Diego, CA), and PCR Protocols: A Guide to Methods 
and Applications (Innis, et al. 1990. Academic Press, San Diego, CA). 

Preferred retroviral vectors are derivatives of lentivirus as well as derivatives of 
murine or avian retroviruses. Examples of suitable retroviral vectors include, for example, 
Moloney murine leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), 

30 murine mammary tumor virus (MuMTV), SIV, BIV, HIV and Rous Sarcoma Virus (RSV). 
A number of retroviral vectors can incorporate multiple exogenous nucleic acid sequences. 
As recombinant retroviruses are defective, they require assistance in order to produce 
infectious vector particles. This assistance can be provided by, for example, helper cell lines 



encoding retrovirus structural genes. Suitable helper cell lines include ^2, PA317 and PA 12, 
among others. The vector virions produced using such cell lines may then be used to infect a 
tissue cell line, such as NIH 3T3 cells, to produce large quantities of chimeric retroviral 
virions. Retroviral vectors may be administered by traditional methods (i.e., injection) or by 

5 implantation of a "producer cell line" in proximity to the target cell population (Culver, K., et 
al, 1994, Hum. Gene Then, 5 (3): 343-79; Culver, K., etal, Cold Spring Harb. Symp. Quant. 
Biol, 59: 685-90); Oldfield, E., 1993, Hum. Gene Ther., 4 (1): 39-69). The producer cell 
line is engineered to produce a viral vector and releases viral particles in the vicinity of the 
target cell. A portion of the released viral particles contact the target cells and infect those 

10 cells, thus delivering a nucleic acid of the present invention to the target cell. Following 
infection of the target cell, expression of the nucleic acid of the vector occurs. 

Adenoviral vectors have proven especially useful for gene transfer into eukaryotic 
cells (Rosenfeld, M., et al, 1991 , Science, 252 (5004): 431-4; Crystal, R., et al., 1994, Nat. 
Genet., 8 (1): 42-51), the study eukaryotic gene expression (Levrero, M, et al., 1991, Gene, 

15 101 (2): 195-202), vaccine development (Graham, F. and Prevec, L., 1992, Biotechnology, 
20: 363-90), and in animal models (Stratford-Perricaudet, L., et al., 1992, Bone Marrow 
Transplant., 9 (Suppl. 1): 151-2 ; Rich, D., et al, 1993, Hum. Gene Ther., 4 (4): 461-76). 
Experimental routes for administrating recombinant Ad to different tissues in vivo have 
included intratracheal instillation (Rosenfeld, M., et al., 1992, Cell, 68 (1): 143-55) injection 

20 into muscle (Quantin, B., et al., 1992, Proc. Natl. Acad Sci. U.S.A., 89 (7): 2581-4), 
peripheral intravenous injection (Herz, J., and Gerard, R., 1993, Proc. Natl. Acad. Sci. U.S.A., 
90 (7): 2812-6) and stereotactic inoculation to brain (Le Gal La Salle, G., et al, 1993, 
Science, 259 (5097): 988-90), among others. 

Adeno-associated virus (AAV) demonstrates high-level infectivity, broad host range 

25 and specificity in integrating into the host cell genome (Hermonat, P., et al., 1984, Proc. Natl. 
Acad. Sci. U.S.A., 81 (20): 6466-70). And Herpes Simplex Virus type-1 (HSV-1) is yet 
another attractive vector system, especially for use in the nervous system because of its 
neurotropic property (Geller, A., et al, 1991, Trends NeuroscL, 14 (10): 428-32; Glorioso, et 
al, 1995, Mol. Biotechnol, 4 (1): 87-99; Glorioso, et al, 1995, Annu. Rev. Microbiol., 49: 

30 675-710). 

Poxvirus is another useful expression vector (Smith, et al. 1983, Gene, 25 (1): 21-8; 
Moss, et al, 1992, Biotechnology, 20: 345-62; Moss, et al, 1992, Curr. Top. Microbiol. 
Immunol, 158: 25-38; Moss, et al. 1991. Science, 252: 1662-1667). Poxviruses shown to be 
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useful include vaccinia, NYVAC, avipox, fowlpox, canarypox, ALVAC, and ALVAC(2), 
among others. 

NYVAC (vP866) was derived from the Copenhagen vaccine strain of vaccinia virus 
by deleting six nonessential regions of the genome encoding known or potential virulence 

5 factors (see, for example, U.S. Pat. Nos. 5,364,773 and 5,494,807). The deletion loci were 
also engineered as recipient loci for the insertion of foreign genes. The deleted regions are: 
thymidine kinase gene (TK; J2R); hemorrhagic region (u; B13R+B14R); A type inclusion 
body region (ATI; A26L); hemagglutinin gene (HA; A56R); host range gene region (C7L- 
K1L); and, large subunit, ribonucleotide reductase (I4L). NYVAC is a genetically 

10 engineered vaccinia virus strain that was generated by the specific deletion of eighteen open 
reading frames encoding gene products associated with virulence and host range. NYVAC 
has been show to be useful for expressing TAs (see, for example, U.S. Pat. No. 6,265,189). 
NYVAC (vP866), vP994, vCP205, vCP1433, placZH6H4Lreverse, pMPC6H6K3E3 and 
pC3H6FHVB were also deposited with the ATCC under the terms of the Budapest Treaty, 

15 accession numbers VR-2559, VR-2558, VR-2557, VR-2556, ATCC-97913, ATCC-97912, 
and ATCC-97914, respectively. 

ALVAC-based recombinant viruses (i.e., ALVAC- 1 and ALVAC-2) are also suitable 
for use in practicing the present invention (see, for example, U.S. Pat. No. 5,756,103). 
ALVAC(2) is identical to ALVAC(l) except that ALVAC(2) genome comprises the vaccinia 

20 E3L and K3L genes under the control of vaccinia promoters (U.S. Pat. No. 6,130,066; Beattie 
et al., 1995a, 1995b, 1991; Chang et al., 1992; Davies et al., 1993). Both ALVAC(l) and 
ALVAC(2) have been demonstrated to be useful in expressing foreign DNA sequences, such 
as TAs (Tartaglia et al., 1993 a,b; U.S. Pat. No. 5,833,975). ALVAC was deposited under the 
terms of the Budapest Treaty with the American Type Culture Collection (ATCC), 10801 

25 University Boulevard, Manassas, Va. 201 10-2209, USA, ATCC accession number VR-2547. 

Another useful poxvirus vector is TROVAC. TROVAC refers to an attenuated 
fowlpox that was a plaque-cloned isolate derived from the FP-1 vaccine strain of 
fowlpoxvirus which is licensed for vaccination of 1 day old chicks. TROVAC was likewise 
deposited under the terms of the Budapest Treaty with the ATCC, accession number 2553. 

30 "Non-viral" plasmid vectors may also be suitable in practicing the present invention. 

Preferred plasmid vectors are compatible with bacterial, insect, and / or mammalian host 
cells. Such vectors include, for example, PCR-II, pCR3, and pcDNA3.1 (Invitrogen, San 
Diego, CA), pBSII (Stratagene, La Jolla, CA), pET15 (Novagen, Madison, WI), pGEX 
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(Pharmacia Biotech, Piscataway, NJ), pEGFP-N2 (Clontech, Palo Alto, CA), pETL 
(BlueBacIl, Invitrogen), pDSR-alpha (PCT pub. No. WO 90/14363) and pFastBacDual 
(Gibco-BRL, Grand Island, NY) as well as Bluescript plasmid derivatives (a high copy 
number COLE 1 -based phagemid, Stratagene Cloning Systems, La Jolla, CA), PCR cloning 

5 plasmids designed for cloning Taq-amplified PCR products (e.g., TOPO™ TA cloning® kit, 
PCR2.1 plasmid derivatives, Invitrogen, Carlsbad, CA). Bacterial vectors may also be used 
with the current invention. These vectors include, for example, Shigella, Salmonella, Vibrio 
cholerae, Lactobacillus, Bacille calmette guerin (BCG), and Streptococcus (see for example, 
WO 88/6626; WO 90/0594; WO 91/13157; WO 92/1796; and WO 92/21376). Many other 

10 non-viral plasmid expression vectors and systems are known in the art and could be used 
with the current invention. 

Suitable nucleic acid delivery techniques include DNA-ligand complexes, adenovirus- 
ligand-DN A complexes, direct injection of DNA, CaP0 4 precipitation, gene gun techniques, 
electroporation, and colloidal dispersion systems, among others. Colloidal dispersion systems 

15 include macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based 
systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The 
preferred colloidal system of this invention is a liposome, which are artificial membrane 
vesicles useful as delivery vehicles in vitro and in vivo. RNA, DNA and intact virions can be 
encapsulated within the aqueous interior and be delivered to cells in a biologically active 

20 form (Fraley, R., et aL, 1 98 1 , Trends Biochem. Sci., 6: 77). The composition of the liposome 
is usually a combination of phospholipids, particularly high-phase-transition-temperature 
phospholipids, usually in combination with steroids, especially cholesterol. Other 
phospholipids or other lipids may also be used. The physical characteristics of liposomes 
depend on pH, ionic strength, and the presence of divalent cations. Examples of lipids useful 

25 in liposome production include phosphatidyl compounds, such as phosphatidylglycerol, 
phosphatidylcholine, phosphatidyl serine, phosphatidylethanolamine, sphingolipids, 
cerebrosides, and gangliosides. Particularly useful are diacy I phosphatidyl glycerols, where 
the lipid moiety contains from 14-18 carbon atoms, particularly from 16-18 carbon atoms, 
and is saturated. Illustrative phospholipids include egg phosphatidylcholine, 

30 dipalmitoylphosphatidylcholine and distearoylphosphatidylcholine. 

An immunogenic target may also be administered in combination with one or more 
adjuvants to boost the immune response. Exemplary adjuvants are shown in Table I below: 
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Table I 

Types of Immunologic Adjuvants 





Type of 
Adjuvant 


General Examples 


Specific Examples/References 


1 


Gel-type 


Aluminum hydroxide/phosphate ("alum 
adjuvants") 


(Aggerbeck and Heron, 1995) 






Calcium phosphate 


(Relyveld, 1986) 


2 


Microbial 


Muramyl dipeptide (MDP) 


(Chedid et al., 1986) 






Bacterial exotoxins 


Cholera toxin (CT), E.coli labile toxin 
(LT)(Frey tag and Clements, 1999) 






Endotox in-based adjuvants 


Monophosphoryl lipid A (MPL) 
(Ulrich and Myers, 1995) 






Other bacterial 


CpG oligonucleotides (Corral and 
Petray, 2000), BCG sequences (Krieg, et 
al. Nature, 374:576), tetanus toxoid 
(Rice,etal. J. Immunol. ,2001, 167: 
1558-1565) 


3 


Particulate 


Biodegradable 
polymer microspheres 


(Gupta etal., 1998) 






Immunostimulatory complexes 
(ISCOMs) 


(Morein and Bengtsson, 1999) 






Liposomes 


(Wassefetal., 1994) 


4 


Oil-emulsion 
and 

surfactant- 
based 
adjuvants 


Freund's incomplete adjuvant 


(Jensen etal., 1998) 












Microfluidized emulsions 


MF59 (Ott et al., 1995) 


SAF (Allison and Byars, 1992) 
(Allison, 1999) 






Saponins 


QS-21 (Kensil, 1996) 


5 


Synthetic 


Muramyl peptide derivatives 


Murabutide (Lederer, 1986) 
Threony-MDP (Allison, 1997) 






Nonionic block copolymers 


L121 (Allison, 1999) 






Polyphosphazene (PCPP) 


(Payne et al., 1995) 






Synthetic polynucleotides 


Poly A:U, Poly I:C (Johnson, 1994) 



5 The immunogenic targets of the present invention may also be used to generate 

antibodies for use in screening assays or for immunotherapy. Other uses would be apparent 
to one of skill in the art. The term "antibody" includes antibody fragments, as are known in 
the art, including Fab, Fab 2 , single chain antibodies (Fv for example), humanized antibodies, 
chimeric antibodies, human antibodies, produced by several methods as are known in the art. 

10 Methods of preparing and utilizing various types of antibodies are well-known to those of 

skill in the art and would be suitable in practicing the present invention (see, for example, 

Harlow, et al. Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988; 

Harlow, et al. Using Antibodies: A Laboratory Manual, Portable Protocol No. /, 1998; 

Kohler and Milstein, Nature, 256:495 (1975)); Jones et al. Nature, 321:522-525 (1986); 
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Riechmann et a!. Nature, 332:323-329 (1988); Presta (Curr. Op. Struct. Biol., 2:593-596 
(1992); Verhoeyen et al. (Science, 239:1534-1536 (1988); Hoogenboom et al., J. Mol. Biol., 
227:381 (1991); Marks et al., J. Mol. Biol., 222:581 (1991); Cole et al., Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985); Boerner et al., J. Immunol., 

5 147(l):86-95 (1991); Marks et al., Bio/Technology 10, 779-783 (1992); Lonberg et al., 
Nature 368 856-859 (1994); Morrison, Nature 368 812-13 (1994); Fishwild et al., Nature 
Biotechnology 14, 845-51 (1996); Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg 
and Huszar, Intern. Rev. Immunol. 13 65-93 (1995); as well as U.S. Pat. Nos. 4,816,567; 
5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; and, 5,661,016). The antibodies or 

10 derivatives therefrom may also be conjugated to therapeutic moieties such as cytotoxic drugs 
or toxins, or active fragments thereof such as diptheria A chain, exotoxin A chain, ricin A 
chain, abrin A chain, curcin, crotin, phenomycin, enomycin, among others. Cytotoxic agents 
may also include radiochemicals. Antibodies and their derivatives may be incorporated into 
compositions of the invention for use in vitro or in vivo. 

15 Nucleic acids, proteins, or derivatives thereof representing an immunogenic target 

may be used in assays to determine the presence of a disease state in a patient, to predict 
prognosis, or to determine the effectiveness of a chemotherapeutic or other treatment 
regimen. Expression profiles, performed as is known in the art, may be used to determine the 
relative level of expression of the immunogenic target. The level of expression may then be 

20 correlated with base levels to determine whether a particular disease is present within the 
patient, the patient's prognosis, or whether a particular treatment regimen is effective. For 
example, if the patient is being treated with a particular chemotherapeutic regimen, an 
decreased level of expression of an immunogenic target in the patient's tissues (i.e., in 
peripheral blood) may indicate the regimen is decreasing the cancer load in that host. 

25 Similarly, if the level of expresssion is increasing, another therapeutic modality may need to 
be utilized. In one embodiment, nucleic acid probes corresponding to a nucleic acid encoding 
an immunogenic target may be attached to a biochip, as is known in the art, for the detection 
and quantification of expression in the host. 

It is also possible to use nucleic acids, proteins, derivatives therefrom, or antibodies 

30 thereto as reagents in drug screening assays. The reagents may be used to ascertain the effect 
of a drug candidate on the expression of the immunogenic target in a cell line, or a cell or 
tissue of a patient. The expression profiling technique may be combined with high 
throughput screening techniques to allow rapid identification of useful compounds and 



monitor the effectiveness of treatment with a drug candidate (see, for example, Zlokarnik, et 
al., Science 279, 84-8 (1998)). Drug candidates may be chemical compounds, nucleic acids, 
proteins, antibodies, or derivatives therefrom, whether naturally occurring or synthetically 
derived. Drug candidates thus identified may be utilized, among other uses, as 
5 pharmaceutical compositions for administration to patients or for use in further screening 
assays. 

Administration of a composition of the present invention to a host may be 
accomplished using any of a variety of techniques known to those of skill in the art. The 
composition(s) may be processed in accordance with conventional methods of pharmacy to 

10 produce medicinal agents for administration to patients, including humans and other 
mammals (i.e., a "pharmaceutical composition"). The pharmaceutical composition is 
preferably made in the form of a dosage unit containing a given amount of DNA, viral vector 
particles, polypeptide or peptide, for example. A suitable daily dose for a human or other 
mammal may vary widely depending on the condition of the patient and other factors, but, 

1 5 once again, can be determined using routine methods. 

The pharmaceutical composition may be administered orally, parentally, by inhalation 
spray, rectally, intranodally, or topically in dosage unit formulations containing conventional 
pharmaceutical ly acceptable carriers, adjuvants, and vehicles. The term "pharmaceutical I y 
acceptable carrier" or "physiologically acceptable carrier" as used herein refers to one or 

20 more formulation materials suitable for accomplishing or enhancing the delivery of a nucleic 
acid, polypeptide, or peptide as a pharmaceutical composition. A "pharmaceutical 
composition" is a composition comprising a therapeutically effective amount of a nucleic 
acid or polypeptide. The terms "effective amount" and "therapeutically effective amount" 
each refer to the amount of a nucleic acid or polypeptide used to induce or enhance an 

25 effective immune response. It is preferred that compositions of the present invention provide 
for the induction or enhancement of an anti-tumor immune response in a host which protects 
the host from the development of a tumor and / or allows the host to eliminate an existing 
tumor from the body. 

For oral administration, the pharmaceutical composition may be of any of several 
30 forms including, for example, a capsule, a tablet, a suspension, or liquid, among others. 
Liquids may be administered by injection as a composition with suitable carriers including 
saline, dextrose, or water. The term parenteral as used herein includes subcutaneous, 
intravenous, intramuscular, intrasternal, infusion, or intraperitoneal administration. 



Suppositories for rectal administration of the drug can be prepared by mixing the drug with a 
suitable non-irritating excipient such as cocoa butter and polyethylene glycols that are solid at 
ordinary temperatures but liquid at the rectal temperature. 

The dosage regimen for immunizing a host or otherwise treating a disorder or a 

5 disease with a composition of this invention is based on a variety of factors, including the 
type of disease, the age, weight, sex, medical condition of the patient, the severity of the 
condition, the route of administration, and the particular compound employed. For example, 
a poxviral vector may be administered as a composition comprising 1 x 10 6 infectious 
particles per dose. Thus, the dosage regimen may vary widely, but can be determined 

10 routinely using standard methods. 

A prime-boost regimen may also be utilized (WO 01/30382 Al) in which the targeted 
immunogen is initially administered in a priming step in one form followed by a boosting 
step in which the targeted immunogen is administered in another form. The form of the 
targeted immunogen in the priming and boosting steps are different. For instance, if the 

15 priming step utilized a nucleic acid, the boost may be administered as a peptide. Simmilarly, 
where a priming step utilized one type of recombinant virus (i.e., ALVAC), the boost step 
may utilize another type of virus (i.e., NYVAC). This prime-boost method of administration 
has been shown to induce strong immunological responses. 

While the compositions of the invention can be administered as the sole active 

20 pharmaceutical agent, they can also be used in combination with one or more other 
compositions or agents (i.e., other immunogenic targets, co-stimulatory molecules, 
adjuvants). When administered as a combination, the individual components can be 
formulated as separate compositions administered at the same time or different times, or the 
components can be combined as a single composition. 

25 Injectable preparations, such as sterile injectable aqueous or oleaginous suspensions, 

may be formulated according to known methods using suitable dispersing or wetting agents 
and suspending agents. The injectable preparation may also be a sterile injectable solution or 
suspension in a non-toxic parenterally acceptable diluent or solvent. Suitable vehicles and 
solvents that may be employed are water, Ringer's solution, and isotonic sodium chloride 

30 solution, among others. For instance, a viral vector such as a poxvirus may be prepared in 
0.4% NaCI. In addition, sterile, fixed oils are conventionally employed as a solvent or 
suspending medium. For this purpose, any bland fixed oil may be employed, including 
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synthetic mono- or diglycerides. In addition, fatty acids such as oleic acid find use in the 
preparation of injectables. 

For topical administration, a suitable topical dose of a composition may be 
administered one to four, and preferably two or three times daily. The dose may also be 

5 administered with intervening days during which no does is applied. Suitable compositions 
may comprise from 0.001% to 10% w/w, for example, from 1% to 2% by weight of the 
formulation, although it may comprise as much as 10% w/w, but preferably not more than 
5% w/w, and more preferably from 0.1% to 1% of the formulation. Formulations suitable for 
topical administration include liquid or semi-liquid preparations suitable for penetration 

10 through the skin (e.g., liniments, lotions, ointments, creams, or pastes) and drops suitable for 
administration to the eye, ear, or nose. 

The pharmaceutical compositions may also be prepared in a solid form (including 
granules, powders or suppositories). The pharmaceutical compositions may be subjected to 
conventional pharmaceutical operations such as sterilization and/or may contain conventional 

15 adjuvants, such as preservatives, stabilizers, wetting agents, emulsifiers, buffers etc. Solid 
dosage forms for oral administration may include capsules, tablets, pills, powders, and 
granules. In such solid dosage forms, the active compound may be admixed with at least one 
inert diluent such as sucrose, lactose, or starch. Such dosage forms may also comprise, as in 
normal practice, additional substances other than inert diluents, e.g., lubricating agents such 

20 as magnesium stearate. In the case of capsules, tablets, and pills, the dosage forms may also 
comprise buffering agents. Tablets and pills can additionally be prepared with enteric 
coatings. Liquid dosage forms for oral administration may include pharmaceutical^ 
acceptable emulsions, solutions, suspensions, syrups, and elixirs containing inert diluents 
commonly used in the art, such as water. Such compositions may also comprise adjuvants, 

25 such as wetting sweetening, flavoring, and perfuming agents. 

Pharmaceutical compositions comprising a nucleic acid or polypeptide of the present 
invention may take any of several forms and may be administered by any of several routes. 
In preferred embodiments, the compositions are administered via a parenteral route 
(intradermal, intramuscular or subcutaneous) to induce an immune response in the host. 

30 Alternatively, the composition may be administered directly into a lymph node (intranodal) 
or tumor mass (i.e., intratumoral administration). For example, the dose could be 
administered subcutaneously at days 0, 7, and 14. Suitable methods for immunization using 
compositions comprising TAs are known in the art, as shown for p53 (Hollstein et al., 1991), 



p21-ras (Almoguera et al., 1988), HER-2 (Fendly et al., 1990), the melanoma-associated 
antigens (MAGE-1; MAGE-2) (van der Bruggen et ah, 1991), p97 (Hu et al., 1988), and 
carcinoembryonic antigen (CEA) (Kantor et al., 1993; Fishbein et al., 1992; Kaufman et al., 
1991), among others. 

5 Preferred embodiments of administratable compositions include, for example, nucleic 

acids or polypeptides in liquid preparations such as suspensions, syrups, or elixirs. Preferred 
injectable preparations include, for example, nucleic acids or polypeptides suitable for 
parental, subcutaneous, intradermal, intramuscular or intravenous administration such as 
sterile suspensions or emulsions. For example, a recombinant poxvirus may be in admixture 

10 with a suitable carrier, diluent, or excipient such as sterile water, physiological saline, glucose 
or the like. The composition may also be provided in lyophilized form for reconstituting, for 
instance, in isotonic aqueous, saline buffer. In addition, the compositions can be co- 
administered or sequentially administered with other antineoplastic, anti-tumor or anti-cancer 
agents and/or with agents which reduce or alleviate ill effects of antineoplastic, anti-tumor or 

1 5 anti-cancer agents. 

A kit comprising a composition of the present invention is also provided. The kit can 
include a separate container containing a suitable carrier, diluent or excipient. The kit can 
also include an additional anti-cancer, anti-tumor or antineoplastic agent and/or an agent that 
reduces or alleviates ill effects of antineoplastic, anti-tumor or anti-cancer agents for co- or 

20 sequential-administration. Additionally, the kit can include instructions for mixing or 
combining ingredients and/or administration. 

A better understanding of the present invention and of its many advantages will be 
had from the following examples, given by way of illustration. 

25 

EXAMPLES 
Example 1 

A. Modification of mCEA (6D) repeat 1 

The presence of truncated forms of CEA in cells following expression of recombinant 
30 CEA has been documented. This study set forth to generate CEA-encoding nucleic acid 
sequences that do not result in the expression of truncated CEA following expression in cells. 
Generation and expression of a new CEA-encoding nucleic acid sequence, CAP(6D)-1,2 
(SEP 1DN0:28) , is described below. 



The plasmid p3'H6MCEA was obtained from Virogenetics, Inc. This plasmid 
contains the MCEA gene with 6D modification under the control of partial H6 promoter (Fig. 
1A : SEP ID NO: 23) . The 912 bp Nrul - BamHl fragment from p3'H6MCEA was cloned 
into pUC18 to form plasmid pSEI544.9 (pUC18-mCEA repeat 1; Fig. IB). 

5 OPC purified Oligos 7524-7526, 7528-7533, 7535-7537, and 7567-7568 were kinased 

and annealed to create two fragments which were ligated to result in a 464 bp synthetic 
modified mCEA repeat 1 flanked by AccI and BamHl sites. This synthetic modified repeat 1 
fragment was cloned into pSE1544.9 AccI-BamHl to create pSE1616.44 (pUC18-mCEA- 
modified repeat 1 ; Fig. 2). The 904 bp EcoRV - BamHl fragment of pSE1616.44 was cloned 

10 back into p3'H6MCEA EcoRV-BamHI to form pSE1658.15 (p3'H6MCEA-modified repeat 
i ; Fig. 3). 



15 B. Modification of mCEA(6D) repeat 2 

A synthetic modified repeat 2 fragment was created by using a method called gene 
splicing by overlap extension (SOE) and cloned into pBluescript-SK+, generating pBSmCEA 
(Fig. 4). The oligos used for the repeat 2 modification are shown below (section IV, B). The 
two different clones (pBS-mCEA-3 and pBS-mCEA-8) contained various point mutations. 

20 The 697 bp BamHl - EcoRl fragment of pBS-mCEA-3 was cloned into pUC18 

BamHl - EcoRl to create pSE1671.8. The 591 bp Spel - Bsu36I fragment of pBS mCEA-8 
was cloned into pSE1671.8 SpeI-Bsu36I, generating plasmid designated pSE 168 1.1. Two site 
PCR mutagenesis, using the Quikchange site directed mutagenesis kit from Stratagene with 
oligos 7751 

25 (GGACGGTAGTAGGTGTATGATGGAGATATAGTTGGGTCGTCTGGGCC : SEP ID 
NP: 24) and 7760 (CAGAATGAATTATCCGTTGATCACTCC ; SEP ID NP: 25) , was 
performed to correct the two remaining point mutations pSE 1 68 1.1. The corrected clone was 
designated pSEl 686.1 (pUC18 mCEA modified repeat 2; Fig. 5). 

As noted recently, an Alanine codon was absent from 5' terminus of the second repeat 

30 in plasmid p3'H6MCEA which contained CEA. To preserve the consistency of the amino 
acid sequence of CEA, the Alanine codon present in plasmid pSEl 686.1 containing the 
modified second repeat of CEA was knocked out. This was accomplished using oligos 7802 
(CGTGACGACGATTACCGTGTATGAGCCACCAAAACCATTCATAACi__SEQ__ID_NOi 



26) and 7803 (GTTATGAATGGTTTTGGTGGCTCATACACGGTAATCGTCGTCACG; 
SEP ID NO: 27) and the Quikchange site-directed mutagenesis kit from Stratagene. The 
resulting plasmid, pSE 1696.1 (pUC18 mCEA modified repeat 2; Fig. 6) was confirmed by 
sequencing. 

5 The 694 bp Bsu361-BamHI fragment from pSE 1696.1 was cloned into Bsu361- 

BamHI site of Psel658.15 to combine modified repeats 1 and 2. The generated plasmid was 
designated p3'H6modMCEA-lst&2nd repeats (Fig. 7). 

C Construction of ALVAC donor plasmid pNVQH6MCEA(6Dlst&2nd) 

10 The 2.2 kb Nrul/Xhol fragment from p3 , H6modMCEA-lst&2ndrepeats was cloned 

into Nrul/Xhol site of pNVQH6LSP-18, generating pNVQH6MCEA(6Dlst&2 nd ; Fig. 8). 
The modified CEA sequence ("CAP(6D)-1,2") contained within pNVQH6MCEA is shown in 
Fig. 9 and SEP ID NO:28 . 

15 

D. Expression of modified CEA 

To test the stability of the CAP(6D)-1,2 sequence upon expression in a cell, the gene 
together with flanking H6 promoter was PCR amplified using 

20 pNVQH6MCEA(6D!ST&2ND) as template and two oligos (8034LZ: 
CTGGCGCGCCTTCTTTATTCTATACTTAAAAAGTG (SEP ID NP: 29) : and 8035LZ: 
CTGGTACCAGAAAAACTATATCAGAGCAACCCCAAC (SEP ID NP: 30) ). The PCR 
product was then cloned into an MY VAC TK donor plasmid designated pLZTKl containing 
the LacZ and K1L marker genes. This vector was specifically made for the generation of 

25 recombinant virus in NYVAC by using blue/white screening method. After in vitro 
recombination between donor plasmid pLZTKlmCEA(6Dlst&2nd) and NYVAC, the 
foreign CAP(6D)-1,2 sequence and marker genes are integrated into the NYVAC genome. 
The plaques containing intermediate recombinant NYVAC with both LacZ and mCEA 
appeared blue. Several rounds of plaque purification were then performed. The second 

30 recombination event kicked out the marker genes resulting in the final white plaques 
containing recombinants with only theCAP(6D)-l,2 sequence but no marker genes (Fig. 10). 

The recombinant white plaques and blue plaques were picked for confirmation of 
CAP(6D)-1,2 sequence expression. Infection was performed using the virus from the 
respective plaques and the cells were harvested three days after infection for preparing either 
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cellular DNA or cell lysate. For isolation of recombinant NYVAC DNA, DNAzol reagent . ' 
(GibcoBRL) was used. PCR (PCR Condition: 95°C (5min)-> [95°C(30sec) -> 49°C(30sec) 
-> 72°C(lmin)] 30 cycles->72°C (7min)-> 4°C) was run to confirm the existence of 
CAP(6D)-1 ,2 sequence in the recombinant NYVAC genome. The primers used were 7569LZ 
5 (5' ttggatccatggagtctccctcggcc 3' forward prime r: SEP ID NO: 3H and 7570LZ (5' 
ttggatccctatatcagagcaacccc 3' reverse prime r; SEP ID NO: 32) . which could amplify the full 
length 2 1 06 bp CAP(6D)-1,2. 

The final recombinant white plaques PRBC-III- 2, 3, 6, 8, 9, 10 all demonstrated the 

2.1 kb CAP(6D)-1,2 sequence band in PCR. PRBC-I1I-N1 was a blue plaque with both 
10 marker genes and CAP(6D)-I,2 sequence still in the viral genome and the CAP(6D)-1,2 

sequence band was also amplified in the PCR. The prominent PCR band amplified from vCP 
307 DNA (containing native CEA integrated into the ALVAC genome) was truncated CEA 
at 1.2 kb with a very faint full-length CEA band. The cell-only sample (no viral infection) 
was used as a negative control and the plasmid pLZTKl MCEA(6D1 ST&2ND) was a positive 
15 control used in the PCR reaction. The PCR results clearly showed the full-length CAP(6D)- 

1 .2 in the recombinant viral genome with no other truncated form of CEA visible. This result 
indicated that CAP(6D)-1,2 has increased stability relative to the native CEA in the ALVAC 
genome. 

Protein expression was also assayed by immunoblot to confirm the absence of 
20 truncated CEA protein in cells expressing CAP(6D)-1,2 (Fig. 11). For isolation of cell lysate, 
cells were first washed with PBS followed by the addition of Lysis Buffer (Reporter Gene 
Assay; Boehringer Mannheim) and shaking for 15 minutes. Cell lysate was spun down at 
13,000 rpm and the supernatant was collected for Western blot analysis. Samples were 
loaded onto a 10% polyacrylamide gel and run at 125 volts. The protein was then transferred 
25 to a PVDF filter membrane (Immobilon-P, Millipore). An HRP-linked mouse CEA 
monoclonal antibody (1:1000; Fitzgerald) was used to detect the expression of mCEA with 
the enhancement from a chemi luminescence reagent (DNA Thunder™; NEN™ Life Science 
Products). 

All six final CAP(6D)-1,2 recombinant white plaques (PRBC-III-2,3,6,8,9,10) and 
30 one intermediate blue plaque (pRBC-III-Nl) showed only one CEA band with no other 
truncated form (Fig. 11). In contrast, protein from vCP307 plaques (recombinant ALVAC 
expressing native CEA) showed a clear truncated CEA product at -60 kDa in addition to the 
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full length CEA. Prolonged exposure of the film verified the absence of any truncated CEA 
polypeptides in the CAP(6D)-1 ,2 recombinants. CEF was used as the negative control. 

In conclusion, the CAP(6D>1,2 recombinants were generated with the mCEA instead 
of the native CEA to prevent the expression of multiple versions of CEA. CAP(6D)-I,2 
5 expressed from recombinant NYVAC was proven effective in eliminating truncated version 
of CEA by both PCR and Western blot. 

While the present invention has been described in terms of the preferred 
10 embodiments, it is understood that variations and modifications will occur to those skilled in 
the art. Therefore, it is intended that the appended claims cover all such equivalent variations 
that come within the scope of the invention as claimed. 
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gggggggg LI S TINC 



CEQ ID NO. i If AAG2 - 1 nuclootido ocquonco 

AGCAGGACCGGGGCCTGTGTCGCTATGGGTTCCGCGGCCGCCCCGGAGCGAGCGCTGGGCTACGTCCGCGAGTTG 
5 AG 

TCGCCACTCCTCCGACGTGCTGGGCAACCTCAACGAGCTGCGCCTGCGGGGGATCGTCACTGACGTCACGGTGCT 
GG 

TTGGCGGGCAACCCCTCACAGCACACAAGGCAGTTCTCATCGCCTGCAGTGGCTTCTTCTxATTCAATTTTCCGGG 
GG 

10 CGTGCGGGAGTCGGGGTGGACGTGCTCTCTCTGGCCGGGGGTCCCGAAGCGAGAGGCTTCGCCCCTCTATTGGAC 

CATGTACACTTCGCGCCTGCGCCTCTCTCCAGCCACTGCACCAGCACTCCTAGCGGCCGCCACCTATTTGCAGAT 
GG 

AGCACGTGGTCCAGGCATGCCACCCCTTCATCCAGGCCAGCTATGAACCTCTGGGCATCTCCCTGCGCCCCCTGC 
15 AA 

GCAGAACCCCCAACACCCCCAACGGCCCCTCCACCAGGTAGTCCCAGGCGCTCCGAAGGACACCCAGACCCACCT 
AG 

TGAATCTCCAAGCTGCAGTCAAGGCCCCCCCAGTCCAGCCAGCCCTGACCCCAAGGCCTGCAACTGGAAAAAGTA 
GA 

20 AGTACATCGTGCTAAACTCTCAGGCCTCCCAAGCAGGGAGCCTGGTCGGGGAGAGAAGTTCTGGTCAACCTTGCC 
GG 

CAAGCCAGGCTCCCCAGTGGAGACGAGGCCTCCAGCAGCAGCAGCAGCAGCAGCACCAGCAGCAGTGAACAAGGA 
GG 

CATTCCTGGTCCCCAGAGCAGGCTCTCTCCAACTGCTGCCACTGTGCAGTTCAAATGTGCGGCTCCAGCCAGTAC 
25 GG 

CCTACCTCCTCACATCCCAGGCTCAAGACACCTCTGGATCACCCTCTGAACGGGCTCGTCCACTACCGGGACTGA 
A* 

TTTTCAGCTGCCAGAACTGTGACGCTCTGGCAGGGTGCTCATCGGGGGCTGGACTCCTTGGTTCCTGGGGACGAA 
GA 

30 CAAACCCTATAAGTGTCAGCTCTGCCCGTCTTCGTTCCGCTACAAGGGCAACCTTGCCAGTCACCGTACAGTGCA 
GA 

CACGGCAAAAGCCTTACCACTGCTCAATCTGCGGAGCCCCTTTTAACCCCCCAGCAAACCTGAAAACGCACAGCC 
GG 

ATCCATTCGGGAGAGAAGCCGTATAAGTGTGAGACGTGCGGCTCGCGCTTTGTACAGGTGCCACATCTGCGGGCG 
35 GA 

CCTGCTGATCCACACCCGGGAGAAGCCCTACCCTTGCCCTACCTGCGGAACCCCCTTCCGCCACCTGCAGACCCT 
GA 

AGAGCCACGTTCGCATCCACACCGCAGACAAGCCTTACCACTGCGACCCCTGTGGCCTGCATTTCCGGCACAAGA 
G=F 

40 CAACTCCCGCTGCATCTGCGCCAGAAACACGGAGCTGCTACCAACACCAAAGTGCACTACCACATTCTCGGCGGG 
GG 

CTAGCTGAGCGCAGGCCCAGGCCCCACTTGCTTCCTGCGGGTGGGAAACCTGCAGGCCCAGGCCTTGCTTCCCTA 
2G 

AGCCTTGGGCATAGGGGTGTGCCAGGCCACTTTGGTATCAGAAATTGCCACCCTCTTAATTTCTCACTGGGGAGA 
45 GG 

AGGGGTGCCAGATCCTGGCTAGATCTGCCTCTGTTTTGGTGGTCANACCGTCTTCCCCACAAGCCAGATTGTTTC 
¥G 

AGGAGAGAGCTAGCTAGGGGCTGGGAAAGGGGACAGATTGGAGTCCTGGTCTCCCTAAGGGAATAGCCCTCCACC 
TO 

50 TGGCCCCCATTGCATTCAGTTTATCTGTAAAATATAATTTATTGAGGCCTTTGGGTGGCACCGGGGCCTTCATTC 
GA 

TTGCATTTCCCACTCCCCTCTTCCACAAGTGTGATTAAAAGTGACCAGAAACACAGAAGGTGAGATCACAGCTCT 
GG 

TGGCAGAGATTACTAGCCCTTGGCTCTCTCGTTTGGCTTGGGTATTTTATATTATTTCTGTCATAACTTTTATCT 
55 ^ 

AGAATTGTTCTTTCTCCTGTTTGTTTGCTTGTTAGTTTGTTTAAAATGGAAAAAGGGGTTCTCTGTGTTCTGCCC 
G£ 

GTAATTCTAGGTCTGGAACCTTTATTTGTTCTAGCGCAGCTCTGGGAACATGCGGGATTGTGGAATTGGGTCAGG 
AA 
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CCCTCTCTCCTATTCTCCATCTTCTACGTTCTCTACCACTCTACAAATGCATACAGACATTTCTCTGTTCTTCAA 
GG 

GTGATAGGAACCATTATGTTGAGGCCAAAATGGAAGTAATAATAAATGGCTGCTGGAGGGTGTGGGTGTGGGGGA 

5 CTGTATCTGGATTCCGTATCACTCCAACTGGAGGCTGTGCGTGTGGGGGATTCTCTATCTGCATTCCGTATCACT 
GG 

AAGTGGAGGCTGGGAGGTTTTTGTGCAAGATGGTCCAGAATCTAAAATGTCCCATTAATGTGGTCACTTGGGTTT 
GG 

CTCTGCTGTATCCATCTATAGTGGTAGAGACCCACCAGGGCTCAAGTGGAGTGCATCATGCTCGCACGGGGGCCT 
10 G¥ 

TCTTAGTAGTGAGTTGATGGGTCGATGGGGGAGAGATCAGAGATTGCTTATCAGAGATGATGTGACGTTTTCTGA 

G^ 

CTGCCCAGTCTCTATGAATGTTATGGCCTAGGGAAGAATCATGAAACTCTTTAGCTTGATTAGATGGTAAACAGT 
G^ 

15 TAACCCATGCTTTACTACAGAGGCATATGGGTTTGAATGTTACCTGGGGTTCTCTCTATTGAGTTGAGCCGGTTC 

CCTTTAGTGGGTTTTGGACATCTTCTGGCAAGTGTCCAGATGCCAGAACCTTCTTTTCCTCTAGAACGGATGGTG 
G^ 

TGGTAACCTTAGGTTTTAAAAGCTGGGTCTGTCACCTGGTCTTCGGATCGGTGCATTGGTGTCTGGAACCAGTGA 
20 A* 

GCATTAGAACCTTCCATAGGAAAAGAAAACGGGCTGAGTTCCATTCTGGGTTTGCTGTAGTTTGGTTGGGATTAT 
¥G 

TTGGCATTACAGATGTAAAAGATTGAGTAGCCCATAGGCCAAAGGCGTGTTCTAGTTGACCAAGTTTGAAGTACG 
A* 

25 TAAGAGGTTGGTTGAGGGGTGCAGTTTCTGGTGTAGGCCAGGTAGGTAGAAAGTGAGGAACAGGGTTGCCTCTTG 
GG 

TGGGTGGAGTCTCTGAAATGTTAGAAGAAGCGCTGAACCCTTGATTGATAGTTCTGGCCCTTGTTGGCGTGGGGC 
ATCTGATTATGGGACGAGGGTAGAAAGTAAGAAGCACTTTTGAATTTGTCGGGTAGAACTTCAACAATAAGTCAG 

30 

GTAGTGGGTGTCGCCTGGGGACTAGTGAGAAAGCTACTGTTGTCCGTCTTCCGTCTTTCTGCCCATGCCCCCACT 
GG 

AGAATTAAAGAAGGAAGAAGGGAAGGCGGAGGAGTCTATAAGAAGGAATCATGATTTCTATTTAGCAGATTGGAT 
GG 

35 GCACGTGGAGAATGCCTGGGGGTAGAAATGTTAGATCTTGCAACATCAGATCCTTGGAATAAAGAAGCCTCTCTC 
GG 

C A AAA AAA AAA AAA AAA AAA AA 
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SSQ- ID NO.! 2/ AAC2 2 Opon reading fr ame 

ATCGGTTCCCCCGCCGCCCCGGAGCCAGCGCTGGGCTACGTCCGCGAGTTCACTCGCCACTCCTCCGACGTGCTGGGCAACCT 
CAACGAGCTGCGCCTGCGCGGGATCCTCACTGACGTCACGCTGCTGGTTGGCGGGCA^iCCCCTCAGAGCACACAAGGCAGTTC 
5 TCATCGCCTGCAGTGGCTTCTTCTATTCAATTTTCCGGGGCCGTGCGGGAGTCGGCGTGGACGTGCTCTCTCTGCCCGGGGGT 
CCCGAAGCGx^GAGGCTTCGCCCCTGTATTGGACTTCATCTACACTTCGCGCCTGCGCCTCTCTCCAGCCACTGCACCAGCAGT 
CCTAGCGGCCGCCACCTATTTGCAGATGGAGCACGTGGTCCAGGCATGCCACCGCTTCATCCAGGCCAGCTJVTGAACCTCTGG 
GCATCTCCCTGCGCCCCCTGGAAGCAGAACCCCCAACACCCCCAACGGCCCCTCCACCAGGTACTCCCAGGCGCTCCGAAGGA 
CACCCAGACCCACCTACTGAATCTCGAAGCTGCAGTCAAGGCCCCCCCAGTCCAGCCAGCCCTGACCCCAAGGCCTGCAACTG 

10 GAAAAAGTACAAGTACATCGTGGTAAACTCTCAGGCCTCCCAAGCAGGGAGCCTGGTCGGGGAGAGAAGTTCTGGTCAACCTT 
GCCCCCAAGCCAGGCTCCCCAGTGGAGACGAGGCCTCCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGTGAAGAAGGACCCATT 
CCTGGTCCCCAGAGCAGGCTCTCTCCAACTGCTGCCACTGTGCAGTTCAAATGTGGGGCTCCAGCCAGTACCCCCTACCTCCT 
CACATCCCACGCTCAAGACACCTCTGGATCACCCTCTGAACGGGCTCGTCCACTACCGGGAAGTGAATTTTTCAGCTGCCAGA 
ACTGTGAGGCTGTGGCAGGGTGGTCATGGGGGGTGCAGTCCTTGGTTCCTGGGGAGGAAGACAAACCCTATAAGTGTCAGGTG 

15 TGCCGGTCTTCGTTCCGCTACAAGGGCAACCTTGCCAGTCATCGTACAGTGCACACAGGGGAAAAGCCTTACCACTGCTCAAT 
CTGCGGAGCCCGTTTTAiACCGGCCAGCA^ACCTGAJiAACGCACAGCCGCATCCATTCGGGAGAGi^iCCCGTATAAGTGTGAGA 
CGTGCGGCTCGCGCTTTGTACAGGTGGCACATCTGCGGGCGCACGTGCTGATCCACACCGGGGAGAAGCCCTACCCTTGCCCT 
ACCTGCGGAACCCGCTTCCGCCACCTGCAGACCCTCAAGAGCCACCTTCGCATCCACACCGCAGAGAAGCCTTACCACTGCGA 
CCCCTGTGGCCTGCATTTCCGGCACAAGAGTCAACTGCGGCTGCATCTGCGCCAGAAACACGGACCTGCTACCAACACCAAAG 

20 TGCACTACCACATTCTCGGGGGGCCCTAG 

SEQ ID NO. ; 3; 7524 

ATACCCCCAACTCCCTAAGCCTTCTATTACCTCCAATAATAGTAAGCCTGTCGAAGACAAACATG 

25 SEQ ID NO. i <\i 152 S 

GCCTGTGTGCCCTAGAGTCGAAGTGAGCAACGGAAATAGAAGTGTGACCCTGTTTAACGTGACCAGGAAC 

SfiQ-ID NO. ! 5; 1528 

ACGTGCTTTACGGACCCGATGCTCCTACAATCAGCCCTCTAAACACAAGCTATAGATCAGGGGAAAATCT 

30 

SSQ- ID NO. ! 6/ 1533 

ACGTTAAACAGGGTCAGAGTTCTATTTCCGTTGCTGAGTTGGAGTCTAGGGGACACAGGCAGGGACTGGT 

SEQ- ID NO. ! 7/ 7535 
35 CTGATCTATAGCTTGTGTTTAGAGGGCTGATTGTAGGAGCATCGGGTCCGTAAAGCACGTTGAGAATCAC 

SSQ- ID NO. ! 8/ 7537 

GATCCACTATTGTTCACGGTAATATTGGGAATGAACAGTTCCTGGGTGGACTGTTGGAAAGTG 

40 SEQ ID NO. ; 9t 75 6-7 

GACACAGCAAGCTACAAATCCGAAACCCAAAATCCAGTCACCGCCAGGAGGTCTGATTCAGTGATTCTCA 

ID NO. ! 10/ 1568 

TGAATCAGACCTCCTGGCGCTCACTCGATTTTGCGTTTCGCATTTGTAGCTTGCTGTGTCGTTCCTGGTC 

45 

SE Q ID NO. ■ 11 ; 75 ^-6 

GATCCTACACGTGCCAAGCTCACAATAGCGACACCGGACTCAACCGCACAACCGTGACGACGATTACCGTGTATG 
CCCA 

50 GEQ ID NO. ! 12/ 7 -5S7 

CATCCTCAACTGGGTTAGAATTGTTACTAGTTATGAATCGTTTTGGTCGCTCGGCATACACGGTAATCGT 

SEQ- ID NO. ; 13; 7677 

TTGTAAGGGAGTTGAGGATGAGGACGCAGTTGCATTAAGTTGTGAGGCAGAGATTCAAAATACCAGTTATTTATG 
55 CTGGG 



GEQ ID NO. ; 14; 1618 

60 GTCTAATGATAACCGCACATTGACACTCCTCTCCGTTACTCGCAATGATGTAGGACCTTATGACTGTGGCATTCA 
GAATG 
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SSe- ID NO. ; 15 1 7619 

TTTGTATCGCCCACACGACCCAACTATATCTCCATCATACACCTACTACCGTCCCCGCCTGAACTTGACCCTTTC 
TTGCC 

5 GEQ ID NO. i 16/ 7680 

TGATGGAAACATTCAGCAGCATACTCAAGAGTTATTTATAAGCAACATAACTGAGAAGAACAGCGGACTCTATAC 
TTGCC 

CEQ ID NO. ; 17; 7 6£j- 
10 TAAAACAATAAGTGTTTCCGCCGAGGTCCCCAAGCCCTCCATCTCCACCAACAAGTCCAAAGCCGTGGAGGACAA 
GGATG 

SSQ- ID NO. ! 1 8 ; 7ggJ? 

ATGTGCGGTTATCATTAGACAACTGCAAGCGTGGGCTAACCGGCAAACTTTGGTTATTGACCCACCATAAATAAG 
15 TCGTA 

CEQ ID NO. ; 19; 7683 

GGTCGTCTGGGCCATACAAAACATTAAGGATAACAGGGTCGGAGTGATCAACGGATAATTCATTCTGAATGCCAC 
ACTCA 

20 

SfiQ- ID NO. t 20; 7684 

GCTGCTGAATGTTTCCATCAATCAGCCAGGAGTACTGTGCAGGGGGGTTGGATGCTGCATGGCAAGAAAGGCTCA 
AGTTC 

25 CEQ ID NO. ! 21; 7 S#£ 

CGG7VAACAGTTATTGTTTTAACTGTAGTCCTGCTGTGACCACTGGCTGAGTTATTGGCCTGGCAAGTATAGAGTC 
CGCTG 

SSQ- ID NO. ! 22; 7686 
30 GGTGAGGTTCACAGGTGAAGGGGACAGCATCCTTGTCCTCCACGGGT 

SSQ ID NO. I 23; CflA CAffgD 

ATGGAGTCTC CCTCGGCCCC TCCCCACAGA TGGTGCATCC CCTGGCACAG GCTCCTGCTC 

ACACCCTCAC TTCTAACCTT CTGGAACCCG CCCACCACTG CCAACCTCAC TATTGAATCC 

35 ACGCCGTTCA ATGTCGCACA CGGGAAGGAG GTGCTTCTAC TTGTCCACAA TCTGCCCCAG 

CATCTTTTTG GCTACAGCTG GTACAAAGGT GAAAGAGTGG ATGCCAACCG TCAAATTATA 

GGATATGTAA TAGGAACTCA ACAAGCTACC CCAGGCCCCG CATACAGTGG TCGAGACATA 

ATATACCCCA ATGCATCCCT GCTGATCCAG AACATCATCC AGAATGACAC AGCATTCTAC 

ACCCTAGACG TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACTGGCCA GTTCCGGGTA 

40 TACCCGGAGC TGCCCAAGCC CTCCATCTCC ACCAACAACT CCAAACCCGT GGAGGACAAG 

GATGCTGTGG CCTTCACCTG TGAACCTGAG ACTCAGGACG CAACCTACCT GTGGTGGGTA 

AACAATCAGA CCCTCCCGGT CAGTCCCAGG CTGCACCTGT CCAATGGCAA CACCACCCTC 

ACTCTATTCA ATGTCACAAG A AAT G AC AC A GCAAGCTACA AATGTGAAAC CCAGAACCCA 

GTGACTGCCA GGCGCAGTGA TTCAGTCATC CTGAATGTCC TCTATGGCCC GGATGCCCCC 

45 ACCATTTCCC CTCTAAACAC ATCTTACAGA TCAGGGGAAA ATCTGAACCT CTCCTGCCAC 

GCAGCCTCTA ACCCACCTGC ACAGTACTCT TGGTTTCTCA ATCGGACTTT CCAGCAATCC 

ACCCAAGAGC TCTTTATCCC CAACATCAGT GTGAATAATA GTGGATGCTA TACGTGCCAA 

GCCCATAACT CAGACACTGG CCTCAATAGG ACCACACTCA CGACGATCAC AGTCTATGAC 

CCACCCAAAC CCTTCATCAC CAGCAACAAC TCCAACCCCG TGGAGGATGA GGATGCTGTA 

50 GCCTTAACCT GTGAACCTCA GATTCAGAAC ACAAGCTACC TGTGGTGGGT AAATAATCAG 

ACCCTCCCGG TCAGTCCCAG GCTGCAGCTG TCC AAT GAGA ACACGACCCT CAGTCTACTG 

AGTGTCACAA GGAATGATGT AGGACCCTAT GAGTGTGGAA TCCAGAACGA ATTAAGTGTT 

GACCACAGCG ACCCAGTCAT CCTGAATGTC CTCTATGGCC CAGACGACCC CACCATTTCC 

CCCTCATACA CCTATTACCG TCCAGCGGTG AACCTCAGCC TCTCCTGCCA TGCAGCCTCT 

55 AACCCACCTG CACAGTATTC TTGGCTGATT GATGGGAACA TCCAGCAACA CACACAACAG 

CTCTTTATCT CCAACATCAC TGAGAAGAAC AGCGGACTCT ATACCTGCCA CGCCAATAAC 

TCACCCAGTG GCCACAGCAG GACTACACTC AAGACAATCA CAGTCTCTGC GGAGCTGCCC 

AACCCCTCCA TCTCCAGCAA CAACTCCAAA CCCGTGGAGG ACAAGGATGC TGTGGCCTTC 

ACCTGTGAAC GTGAGGCTCA GAACACAACC TACCTGTGGT GGGTAAATGG TCAGACCCTG 

60 CCAGTCAGTC CCAGGCTGCA GCTGTCCAAT GGCAACAGCA CCCTCACTCT ATTCAATCTC 
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ACAAGAAATG 
ACTCACCCAG 


ACGCAAGAGC 
TCACCCTGGA 


CTATGTATGT 
TGTCCTCTAT 


GGAATGCAGA 
GGGCCGGACA 


ACTCAGTGAG 
CCCCCATCAT 


TGCAAACCGC 
TTCCCCCCCA 




AGCTTTCGGG 


AGGGGAGGTC 


AAGGTGTGGT 


GCCACTCGGC 


CTCTAAGCCA 


GACTCGTCTT 
TCCCCGCAGT 


ATTCTTGGCG 


TATCAATGGG 


ATACCGCAGC 


AACACACACA 


AGTTCTCTTT 


ATCGCCAAAA 


TCACGCCAAA 


TAATAACGCG 


ACCTATGCCT 


GTTTTGTCTC 


TAACTTGGCT 


ACTGGCCGCA 


ATAATTCCAT 


AGTCAAGAGC 


ATCACAGTCT 


CTGCATCTGG 


AACTTCTCCT 


GGTGTGTCAG 


CTGGGGCCAC 


TGTCGGCATC 


ATGATTGGAG 


TGCTGGTTGG 


GGTTGGTGTG 



SSQ- ID HO. ! 24; CAD6D <-±t2 

10 ATGGAGTCTC CCTCGGCCCC TCCCCACAGA TGGTGCATGC CCTGGGAGAG GCTGCTGCTG 
ACAGCCTCAC TTCTAACCTT CTCCAACCCG CCCACCACTG CCAAGCTCAC TATTGAATCC 
ACGCCGTTCA ATGTCGCAGA GGGCAAGCAG GTGCTTCTAC TTGTCCACAA TCTGCCCCAG 
CATCTTTTTG GCTACAGCTG GTACAAAGGT GAAAGAGTGG ATGGCAACCG TCAAATTATA 
GGATATGTAA TAGGAACTCA ACAAGCTACC CCAGGGCCGG CATACAGTGG TCGAGAGATA 

15 ATATACCCCA ATCCATCCCT GCTGATCCAG AACATCATCC AGAATGACAC AGGATTCTAC 
ACCCTACACG TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACTGGCCA GTTCCGGGTA 
TACCCCGAAC TCCCTAAGCC TTCTATTAGC TCCAATAATA GTAAGCCTGT CGAAGACAAA 
GATGCGGTCG CTTTTACATG GGAGCCCGAA ACTGAAGAGG GAAGATATGT GTGCTGGGTG 
AACAACCAGT CCCTGCCTCT GTCCCCTAGA CTCCAACTCA CCAACGGAAA TAGAACTCTG 

20 ACCCTGTTTA ACGTGACCAG GAACGACACA GCAAGCTACA AATGCGAAAC CCAAAATCCA 
GTCAGCGCCA GGAGGTCTCA TTCAGTGATT CTCAACGTCC TTTACGGACC CGATGCTCCT 
ACAATCAGCC CTCTAAACAC A AG C TAT AG A TCAGGGGAAA ATCTGAATCT GAGGTGTCAT 
GCCGCTAGCA ATCCTCCCGC CCAATACAGC TGGTTTGTCA ATGGCACTTT CCAACAGTCC 
ACCCAGGAAC TGTTCATTCC CAATATTACC GTGAACAATA CTGCATCCTA CACGTGCCAA 

25 GCTCACAATA GCGACACCGG ACTCAACCGC ACAACCGTGA CGACGATTAC CGTGTATGAG 
CCACCAAAAC CATTCATAAC TACTAACAAT TCTAACCCAG TTGAGGATGA GGAGGGAGTT 
GCATTAACTT GTGAGCCAGA GATTCAAAAT ACCACTTATT TATGGTGGGT CAATAACCAA 
AGTTTGCCCG TTAGCCCACG CTTGCAGTTG TCTAATGATA ACCGCACATT GACACTCCTG 
TCCGTTACTC CCAATGATCT AGCACCTTAT CAGTGTCGCA TTGAGAATGA ATTATCCGTT 

30 GATCAGTCCG ACCCTGTTAT CCTTAATGTT TTGTATGGCC CAGACGACGG AAGTATATGT 
CCATCATACA CCTACTACCG TCCCGGCGTG AACTTGAGCC TTTCTTGCCA TGCAGCATCC 
AACCCCCCTG CACAGTACTC CTCGCTGATT GATGGAAACA TTCAGCAGCA TACTCAAGAG 
TTATTTATAA CCAACATAAC TCAGAACAAC AGCGGACTCT ATACTTGCCA GGCCAATAAC 
TCAGCCAGTG GTCACAGCAG GACTACAGTT AAAACAATAA CTGTTTCCGC GCAGCTGCCC 

35 AAGCCCTCCA TCTCCAGCAA CAACTCCAAA CCCCTGGAGC ACAAGGATGC TGTGGCCTTC 
ACCTGTGAAC CTGAGGCTCA GAACACAACC TACCTCTGGT GGGTAAATGG TCAGAGCCTC 
CCAGTCAGTC CCAGGCTGCA GCTGTCCAAT GGCAACAGGA CCCTCACTCT ATTCAATGTC 
ACAAGAAATG ACGCAAGAGC CTATGTATGT GGAATGCAGA ACTCAGTGAG TGCAAACCGC 
AGTGACCCAG TCACCCTGGA TGTCCTCTAT GGGCCGGACA CCCCCATCAT TTCCCCCCCA 

40 GACTCGTCTT ACCTTTCGGC AGCGGACCTC AACCTCTCCT GCCACTCGGC CTCTAACCCA 
TCCCCGCAGT ATTCTTGGCG TATCAATCGC ATACCGCAGC AACACACACA AGTTCTCTTT 
ATCCCCAAAA TCAGGGCAAA TAATAACGCG ACCTxATGCGT GTTTTGTCTC TAACTTGGCT 
ACTGGCCGCA ATAATTCCAT AGTCAAGAGC ATCACAGTCT CTGCATCTGG AACTTCTCCT 
GGTCTCTCAG CTGGGGCCAC TGTCGGCATC ATGATTGGAG TGCTGGTTGG GGTTGCTCTG ATATAG 

45 

SEQUENCE LISTING 



<110> Sanofi Pasteur, Ltd. 

Therion Biologies, Inc. 

50 

<120> Modified CEA Nucleic Acid and Expression Vectors 



<130> API-01-20-US 



55 <140> 10/510,677 

38 



<141> 2003-04-09 



<150> US 60/370,972 

<151> 2002-04-09 

5 

<160> 32 

<170> Patentln version 3.3 

10 <210> 1 

<211> 3564 

<212> DNA 

<213> Homo sapiens 

15 

<220> 

<221> misc_feature 
<222> (1663) . . (1663) 
<223> n is a, c, g, or t 

20 

<400> 1 

agcaggaccg gggcctgtgt cgctatgggt tcccccgccg ccccggaggg agcgctgggc 60 
tacgtccgcg agttcactcg ccactcctcc gacgtgctgg gcaacctcaa cgagctgcgc 120 

25 

ctgcgcggga tcctcactga cgtcacgctg ctggttggcg ggcaacccct cagagcacac 180 
aaggcagttc tcatcgcctg cagtggcttc ttctattcaa ttttccgggg ccgtgcggga 240 
30 gtcggggtgg acgtgctctc tctgcccggg ggtcccgaag cgagaggctt cgcccctcta 300 
ttggacttca tgtacacttc gcgcctgcgc ctctctccag ccactgcacc agcagtccta 360 
gcggccgcca cctatttgca gatggagcac gtggtccagg catgccaccg cttcatccag 420 

35 

gccagctatg aacctctggg catctccctg cgccccctgg aagcagaacc cccaacaccc 480 

ccaacggccc ctccaccagg tagtcccagg cgctccgaag gacacccaga cccacctact 540 

40 gaatctcgaa gctgcagtca aggccccccc agtccagcca gccctgaccc caaggcctgc 600 

39 



aactggaaaa agtacaagta catcgtgcta 
gtcggggaga gaagttctgg tcaaccttgc 

5 

gcctccagca gcagcagcag cagcagcagc 
ccccagagca ggctctctcc aactgctgcc 
10 agtaccccct acctcctcac atcccaggct 
gctcgtccac taccgggagt gaatttttca 
gctcatcggg ggctggactc cttggttcct 

15 

ctgtgccggt cttcgttccg ctacaagggc 
ggggaaaagc cttaccactg ctcaatctgc 
20 aaaacgcaca gccgcatcca ttcgggagag 
cgctttgtac aggtggcaca tctgcgggcg 
tacccttgcc ctacctgcgg aacccgcttc 

25 

cgcatccaca ccggagagaa gccttaccac 
aagagtcaac tgcggctgca tctgcgccag 
30 cactaccaca ttctcggggg gccctagctg 
cgggtgggaa agctgcaggc ccaggccttg 
gccaggccac tttggtatca gaaattgcca 

35 

ggtggcagat cctggctaga tctgcctctg 
ccagattgtt tctgaggaga gagctagcta 
40 tggtctccct aagggaatag ccctccacct 



aactctcagg cctcccaagc agggagcctg 660 

ccccaagcca ggctccccag tggagacgag 720 

agcagcagtg aagaaggacc cattcctggt 780 

actgtgcagt tcaaatgtgg ggctccagcc 840 

caagacacct ctggatcacc ctctgaacgg 900 

gctgccagaa ctgtgaggct gtggcagggt 960 

ggggacgaag acaaacccta taagtgtcag 1020 

aaccttgcca gtcaccgtac agtgcacaca 1080 

ggagcccgtt ttaaccggcc agcaaacctg 1140 

aagccgtata agtgtgagac gtgcggctcg 1200 

cacgtgctga tccacaccgg ggagaagccc 1260 

cgccacctgc agaccctcaa gagccacgtt 1320 

tgcgacccct gtggcctgca tttccggcac 1380 

aaacacggag ctgctaccaa caccaaagtg 1440 

agcgcaggcc caggccccac ttgcttcctg 1500 

cttccctatc aggcttgggc ataggggtgt 1560 

ccctcttaat ttctcactgg ggagagcagg 1620 

ttttgctggt canaccctct tccccacaag 1680 

ggggctggga aaggggagag attggagtcc 1740 

gtggccccca ttgcattcag tttatctgta 1800 
40 



aaatataatt tattgaggcc tttgggtggc 
ctcccctctt ccacaagtgt gattaaaagt 

5 

tctgctggca gagattacta gcccttggct 
ttctgtcata acttttatct ttagaattgt 
10 gtttaaaatg gaaaaagggg ttctctgtgt 
ttatttgttc tagggcagct ctgggaacat 
ctctggtatt ctggatgttg taggttctct 

15 

ctgttcttca agggtgatag gaaccattat 
gcctcctgga ggctgtgggt gtgggggatt 
20 gaggctgtgg gtgtggggga ttctgtatct 
caggtttttc tgcaagatgg tccagaatct 
ttggctctgc tgtatccatc tatagtggta 

25 

catcctccca cgggggcctg ttcttagtac 
gacattcctt atcagagatg atgtgacctt 
30 atggcctagg gaagaatcat gaaactcttt 
catcctttac tacagaggca tatgggtttg 
gagccccttc ttcctttagt gggttttgga 

35 

ccttcttttc ctctagaagg gatggtgctt 
tgacctggtc ttcccatccc tgcattcctg 
40 cataggaaaa gaaaaggggc tgagttccat 



accggggcct tcattcgatt gcatttccca 1860 

gaccagaaac acagaaggtg agatcacagc 1920 

ctctcgtttg gcttgggtat tttatattat 1980 

tctttctcct gtttgtttgc ttgttagttt 2040 

tctgcccctg taattctagg tctggaacct 2100 

gcgggattgt ggaattgggt caggaaccct 2160 

agcagtctag aaatggatac agacatttct 2220 

gttgagccca aaatggaagt aataataaat 2280 

ctgtatctgg attccgtatc actccaactg 2340 

ggattccgta tcactccaag tggaggctgg 2400 

aaaatgtccc attaatctgg tcacttgggt 2460 

gagacccacc agggctcaag tggagtccat 2520 

tgagttgatc gctccatggg ggagagatca 2580 

ttctgactct gcccagtctc tatgaatgtt 2640 

agcttgatta gatggtaaac agtgttaacc 2700 

aatgttacct ggggttctct ctattgagtt 2760 

catcttctgg caagtgtcca gatgccagaa 2820 

ggtaacctta ccttttaaaa gctgggtctg 2880 

tctggaacca gtgaatgcat tagaaccttc 2940 

tctgggtttg ctgtagtttg gttgggatta 3000 
41 



ttgttggcat tacagatgta aaagattgac 
gaccaagttt caagtaggat taagaggttg 

5 

ggtaggtaga aagtgaggaa cagggttgcc 
gaagaagcgc tgaagccttg attgatagtt 
10 ttatgggacg agggtagaaa gtaagaagca 
ataagtcagt tctagtggct gtcgcctggg 
tccctctttc tccccatggc cccactgcag 

15 

agtctataag aaggaatcat gatttctatt 
ctgggggtag aaatgttaga tcttgcaaca 
20 cgcaaaaaaa aaaaaaaaaa aaaa 



<210> 2 

<211> 1440 

25 <212> DNA 

<213> Homo sapiens 

<400> 2 

atgggttccc ccgccgcccc ggagggagcg 

30 

tcctccgacg tgctgggcaa cctcaacgag 

acgctgctgg ttggcgggca acccctcaga 

35 ggcttcttct attcaatttt ccggggccgt 

cccgggggtc ccgaagcgag aggcttcgcc 

ctgcgcctct ctccagccac tgcaccagca 

40 



tagcccatag gccaaaggcc tgttctagtt 3060 

gttgaggggt gcagtttctg gtgtaggcca 3120 

tcttggctgg gtggagtctc tgaaatgtta 3180 

ctgccccttg ttgccctggg gcttatctga 3240 

cttttgaatt tgtggggtag aacttcaaca 3300 

gactagtgag aaagctactc ttctccctct 3360 

aattaaagaa ggaagaaggg aaggcggagg 3420 

tagcagattg gatgggcagg tggagaatgc 3480 

tcagatcctt ggaataaaga agcctctctg 3540 

3564 



ctgggctacg tccgcgagtt cactcgccac 60 

ctgcgcctgc gcgggatcct cactgacgtc 120 

gcacacaagg cagttctcat cgcctgcagt 180 

gcgggagtcg gggtggacgt gctctctctg 240 

cctctattgg acttcatgta cacttcgcgc 300 

gtcctagcgg ccgccaccta tttgcagatg 360 

42 



gagcacgtgg tccaggcatg ccaccgcttc 
tccctgcgcc ccctggaagc agaaccccca 
5 cccaggcgct ccgaaggaca cccagaccca 
ccccccagtc cagccagccc tgaccccaag 
gtgctaaact ctcaggcctc ccaagcaggg 

10 

ccttgccccc aagccaggct ccccagtgga 
agcagcagca gtgaagaagg acccattcct 
15 gccactgtgc agttcaaatg tggggctcca 
gctcaagaca cctctggatc accctctgaa 
ttcagctgcc agaactgtga ggctgtggca 

20 

cctggggacg aagacaaacc ctataagtgt 
ggcaaccttg ccagtcatcg tacagtgcac 
25 tgcggagccc gttttaaccg gccagcaaac 
gagaagccgt ataagtgtga gacgtgcggc 
gcgcacgtgc tgatccacac cggggagaag 

30 

ttccgccacc tgcagaccct caagagccac 
cactgcgacc cctgtggcct gcatttccgg 
35 cagaaacacg gagctgctac caacaccaaa 

<210> 3 
<211> 65 
40 <212> DNA 



atccaggcca gctatgaacc tctgggcatc 420 

acacccccaa cggcccctcc accaggtagt 480 

cctactgaat ctcgaagctg cagtcaaggc 540 

gcctgcaact ggaaaaagta caagtacatc 600 

agcctggtcg gggagagaag ttctggtcaa 660 

gacgaggcct ccagcagcag cagcagcagc 720 

ggtccccaga gcaggctctc tccaactgct 780 

gccagtaccc cctacctcct cacatcccag 840 

cgggctcgtc cactaccggg aagtgaattt 900 

gggtgctcat cggggctgga ctccttggtt 960 

cagctgtgcc ggtcttcgtt ccgctacaag 1020 

acaggggaaa agccttacca ctgctcaatc 1080 

ctgaaaacgc acagccgcat ccattcggga 1140 

tcgcgctttg tacaggtggc acatctgcgg 1200 

ccctaccctt gccctacctg cggaacccgc 1260 

gttcgcatcc acaccggaga gaagccttac 1320 

cacaagagtc aactgcggct gcatctgcgc 1380 

gtgcactacc acattctcgg ggggccctag 1440 

43 



<213> Homo sapiens 
<400> 3 

atacccggaa ctccctaagc cttctattag ctccaataat agtaagcctg tcgaagacaa 60 

5 

agatg 65 



<210> 4 

10 <211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 4 

15 gcctgtgtcc cctagactcc aactcagcaa cggaaataga actctgaccc tgtttaacgt 60 
gaccaggaac 70 



20 <210> 5 

<211> 70 

<212> DNA 

<213> Homo sapiens 

25 <400> 5 

acgtgcttta cggacccgat gctcctacaa tcagccctct aaacacaagc tatagatcag 60 
gggaaaatct 70 



30 



35 



<210> 6 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 6 

acgttaaaca gggtcagagt tctatttccg ttgctgagtt ggagtctagg ggacacaggc 60 



agggactggt 

40 



44 



70 



<210> 7 

<211> 70 

<212> DNA 

5 <213> Homo sapiens 

<400> 7 

ctgatctata gcttgtgttt agagggctga ttgtaggagc atcgggtccg taaagcacgt 60 
10 tgagaatcac 70 



<210> 8 

<211> 63 

15 <212> DNA 

<213> Homo sapiens 



<400> 8 

gatccactat tgttcacggt aatattggga atgaacagtt cctgggtgga ctgttggaaa 60 

20 

gtg 63 



<210> 9 

25 <211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 9 

30 gacacagcaa gctacaaatg cgaaacccaa aatccagtca gcgccaggag gtctgattca 60 
gtgattctca 70 



35 <210> 10 

<211> 70 

<212> DNA 

<213> Homo sapiens 



40 <400> 10 



45 



tgaatcagac ctcctggcgc tgactggatt ttgggtttcg catttgtagc ttgctgtgtc 60 



10 



15 



gttcctggtc 70 



<210> 11 

<211> 79 

<212> DNA 

<213> Homo sapiens 

<400> 11 

gatcctacac gtgccaagct cacaatagcg acaccggact caaccgcaca accgtgacga 60 
cgattaccgt gtatgccga 79 



<210> 12 

<211> 70 

<212> DNA 

20 <213> Homo sapiens 

<400> 12 

catcctcaac tgggttagaa ttgttactag ttatgaatgg ttttggtggc tcggcataca 60 
25 cggtaatcgt 70 



<210> 13 

<211> 80 

30 <212> DNA 

<213> Homo sapiens 



<400> 13 

ttctaaccca gttgaggatg aggacgcagt tgcattaact tgtgagccag agattcaaaa 60 

35 

taccacttat ttatggtggg 80 



<210> 14 
40 <211> 80 



46 



<212> DNA 

<213> Homo sapiens 

<400> 14 

5 gtctaatgat aaccgcacat tgacactcct gtccgttact cgcaatgatg taggacctta 60 
tgagtgtggc attcagaatg 80 



10 <210> 15 

<211> 80 

<212> DNA 

<213> Homo sapiens 

15 <400> 15 

tttgtatggc ccagacgacc caactatatc tccatcatac acctactacc gtcccggcgt 60 
gaacttgagc ctttcttgcc 80 



20 



25 



30 



<210> 16 

<211> 80 

<212> DNA 

<213> Homo sapiens 

<400> 16 

tgatggaaac attcagcagc atactcaaga gttatttata agcaacataa ctgagaagaa 60 
cagcggactc tatacttgcc 80 



<210> 17 

<211> 80 

<212> DNA 

35 <213> Homo sapiens 

<400> 17 

taaaacaata actgtttccg cggagctgcc caagccctcc atctccagca acaactccaa 60 



40 acccgtggag gacaaggatg 



47 



80 



<210> 18 

<211> 80 

5 <212> DNA 

<213> Homo sapiens 

<400> 18 

atgtgcggtt atcattagac aactgcaagc gtgggctaac cggcaaactt tggttattga 60 

10 

cccaccataa ataagtggta 80 



<210> 19 

15 <211> 80 

<212> DNA 

<213> Homo sapiens 

<400> 19 

20 ggtcgtctgg gccatacaaa acattaagga taacagggtc ggagtgatca acggataatt 60 
cattctgaat gccacactca 80 



25 <210> 20 

<211> 80 

<212> DNA 

<213> Homo sapiens 

30 <400> 20 

gctgctgaat gtttccatca atcagccagg agtactgtgc aggggggttg gatgctgcat 60 
ggcaagaaag gctcaagttc 80 

35 

<210> 21 

<211> 80 

<212> DNA 

<213> Homo sapiens 

40 



48 



<400> 21 

cggaaacagt tattgtttta actgtagtcc tgctgtgacc actggctgag ttattggcct 60 
ggcaagtata gagtccgctg 80 

5 

<210> 22 
<211> 47 
<212> DNA 
10 <213> Homo sapiens 

<400> 22 

cctcaggttc acaggtgaag gccacagcat ccttgtcctc cacgggt 47 

15 

<210> 23 

<211> 2106 

<212> DNA 

<213> Homo sapiens 

20 

<400> 23 

atggagtctc cctcggcccc tccccacaga tggtgcatcc cctggcagag gctcctgctc 60 
acagcctcac ttctaacctt ctggaacccg cccaccactg ccaagctcac tattgaatcc 120 

25 

acgccgttca atgtcgcaga ggggaaggag gtgcttctac ttgtccacaa tctgccccag 180 
catctttttg gctacagctg gtacaaaggt gaaagagtgg atggcaaccg tcaaattata 240 
30 ggatatgtaa taggaactca acaagctacc ccagggcccg catacagtgg tcgagagata 300 
atatacccca atgcatccct gctgatccag aacatcatcc agaatgacac aggattctac 360 
accctacacg tcataaagtc agatcttgtg aatgaagaag caactggcca gttccgggta 420 

35 

tacccggagc tgcccaagcc ctccatctcc agcaacaact ccaaacccgt ggaggacaag 480 

gatgctgtgg ccttcacctg tgaacctgag actcaggacg caacctacct gtggtgggta 540 

40 aacaatcaga gcctcccggt cagtcccagg ctgcagctgt ccaatggcaa caggaccctc 600 

49 



actctattca atgtcacaag aaatgacaca 
gtgagtgcca ggcgcagtga ttcagtcatc 

5 

accatttccc ctctaaacac atcttacaga 
gcagcctcta acccacctgc acagtactct 
10 acccaagagc tctttatccc caacatcact 
gcccataact cagacactgg cctcaatagg 
ccacccaaac ccttcatcac cagcaacaac 

15 

gccttaacct gtgaacctga gattcagaac 
agcctcccgg tcagtcccag gctgcagctg 
20 agtgtcacaa ggaatgatgt aggaccctat 
gaccacagcg acccagtcat cctgaatgtc 
ccctcataca cctattaccg tccaggggtg 

25 

aacccacctg cacagtattc ttggctgatt 
ctctttatct ccaacatcac tgagaagaac 
30 tcagccagtg gccacagcag gactacagtc 
aagccctcca tctccagcaa caactccaaa 
acctgtgaac ctgaggctca gaacacaacc 

35 

ccagtcagtc ccaggctgca gctgtccaat 
acaagaaatg acgcaagagc ctatgtatgt 
40 agtgacccag tcaccctgga tgtcctctat 



gcaagctaca aatgtgaaac ccagaaccca 660 

ctgaatgtcc tctatggccc ggatgccccc 720 

tcaggggaaa atctgaacct ctcctgccac 780 

tggtttgtca atgggacttt ccagcaatcc 840 

gtgaataata gtggatccta tacgtgccaa 900 

accacagtca cgacgatcac agtctatgag 960 

tccaaccccg tggaggatga ggatgctgta 1020 

acaacctacc tgtggtgggt aaataatcag 1080 

tccaatgaca acaggaccct cactctactc 1140 

gagtgtggaa tccagaacga attaagtgtt 1200 

ctctatggcc cagacgaccc caccatttcc 1260 

aacctcagcc tctcctgcca tgcagcctct 1320 

gatgggaaca tccagcaaca cacacaagag 1380 

agcggactct atacctgcca ggccaataac 1440 

aagacaatca cagtctctgc ggagctgccc 1500 

cccgtggagg acaaggatgc tgtggccttc 1560 

tacctgtggt gggtaaatgg tcagagcctc 1620 

ggcaacagga ccctcactct attcaatgtc 1680 

ggaatccaga actcagtgag tgcaaaccgc 1740 

gggccggaca cccccatcat ttccccccca 1800 
50 



gactcgtctt acctttcggg agcggacctc aacctctcct gccactcggc ctctaaccca 1860 



tccccgcagt attcttggcg tatcaatggg ataccgcagc aacacacaca agttctcttt 1920 

5 

atcgccaaaa tcacgccaaa taataacggg acctatgcct gttttgtctc taacttggct 1980 

actggccgca ataattccat agtcaagagc atcacagtct ctgcatctgg aacttctcct 2040 

10 ggtctctcag ctggggccac tgtcggcatc atgattggag tgctggttgg ggttgctctg 2100 

atatag 2106 

15 <210> 24 

<211> 47 

<212> DNA 

<213> synthetic 

20 <400> 24 

ggacggtagt aggtgtatga tggagatata gttgggtcgt ctgggcc 47 

<210> 25 

25 <211> 27 

<212> DNA 

<213> Synthetic 

<400> 25 

30 cagaatgaat tatccgttga tcactcc 27 



<210> 26 

<211> 45 

35 <212> DNA 

<213> Synthetic 

<400> 26 

cgtgacgacg attaccgtgt atgagccacc aaaaccattc ataac 45 

40 

51 



<210> 27 

<211> 45 

<212> DNA 

5 <213> Synthetic 



<400> 27 

gttatgaatg gttttggtgg ctcatacacg gtaatcgtcg tcacg 45 

10 

<210> 28 
<211> 2106 
<212> DNA 
<213> Synthetic 

15 

<4O0> 28 

atggagtctc cctcggcccc tccccacaga tggtgcatcc cctggcagag gctcctgctc 60 
acagcctcac ttctaacctt ctggaacccg cccaccactg ccaagctcac tattgaatcc 120 

20 

acgccgttca atgtcgcaga ggggaaggag gtgcttctac ttgtccacaa tctgccccag 180 
catctttttg gctacagctg gtacaaaggt gaaagagtgg atggcaaccg tcaaattata 240 
25 ggatatgtaa taggaactca acaagctacc ccagggcccg catacagtgg tcgagagata 300 
atatacccca atgcatccct gctgatccag aacatcatcc agaatgacac aggattctac 360 
accctacacg tcataaagtc agatcttgtg aatgaagaag caactggcca gttccgggta 420 

30 

tacccggaac tccctaagcc ttctattagc tccaataata gtaagcctgt cgaagacaaa 480 
gatgccgtcg cttttacatg cgagcccgaa actcaagacg caacatatct ctggtgggtg 540 
35 aacaaccagt ccctgcctgt gtcccctaga ctccaactca gcaacggaaa tagaactctg 600 
accctgttta acgtgaccag gaacgacaca gcaagctaca aatgcgaaac ccaaaatcca 660 
gtcagcgcca ggaggtctga ttcagtgatt ctcaacgtgc tttacggacc cgatgctcct 720 

40 

52 



acaatcagcc 
gccgctagca 
5 acccaggaac 
gctcacaata 
ccaccaaaac 

10 

gcattaactt 
agtttgccgg 
15 tccgttactc 
gatcactccg 
ccatcataca 

20 

aacccccctg 
ttatttataa 
25 tcagccagtg 
aagccctcca 
acctgtgaac 

30 

ccagtcagtc 
acaagaaatg 
35 agtgacccag 
gactcgtctt 
tccccgcagt 



ctctaaacac aagctataga tcaggggaaa 
atcctcccgc ccaatacagc tggtttgtca 
tgttcattcc caatattacc gtgaacaata 
gcgacaccgg actcaaccgc acaaccgtga 
cattcataac tagtaacaat tctaacccag 
gtgagccaga gattcaaaat accacttatt 
ttagcccacg cttgcagttg tctaatgata 
gcaatgatgt aggaccttat gagtgtggca 
accctgttat ccttaatgtt ttgtatggcc 
cctactaccg tcccggcgtg aacttgagcc 
cacagtactc ctggctgatt gatggaaaca 
gcaacataac tgagaagaac agcggactct 
gtcacagcag gactacagtt aaaacaataa 
tctccagcaa caactccaaa cccgtggagg 
ctgaggctca gaacacaacc tacctgtggt 
ccaggctgca gctgtccaat ggcaacagga 
acgcaagagc ctatgtatgt ggaatccaga 
tcaccctgga tgtcctctat gggccggaca 
acctttcggg agcggacctc aacctctcct 
attcttggcg tatcaatggg ataccgcagc 



atctgaatct gagctgtcat 780 

atggcacttt ccaacagtcc 840 

gtggatccta cacgtgccaa 900 

cgacgattac cgtgtatgag 960 

ttgaggatga ggacgcagtt 1020 

tatggtgggt caataaccaa 1080 

accgcacatt gacactcctg 1140 

ttcagaatga attatccgtt 1200 

cagacgaccc aactatatct 1260 

tttcttgcca tgcagcatcc 1320 

ttcagcagca tactcaagag 1380 

atacttgcca ggccaataac 1440 

ctgtttccgc ggagctgccc 1500 

acaaggatgc tgtggccttc 1560 

gggtaaatgg tcagagcctc 1620 

ccctcactct attcaatgtc 1680 

actcagtgag tgcaaaccgc 1740 

cccccatcat ttccccccca 1800 

gccactcggc ctctaaccca 1860 

aacacacaca agttctcttt 1920 



53 



atcgccaaaa tcacgccaaa taataacggg acctatgcct gttttgtctc taacttggct 1980 

actggccgca ataattccat agtcaagagc atcacagtct ctgcatctgg aacttctcct 2040 

5 ggtctctcag ctggggccac tgtcggcatc atgattggag tgctggttgg ggttgctctg 2100 

atatag 2106 



10 <210> 29 

<211> 35 

<212> DNA 

<213> Synthetic 

15 <400> 29 

ctggcgcgcc ttctttattc tatacttaaa aagtg 35 



<210> 30 

20 <211> 36 

<212> DNA 

<213> Synthetic 

<400> 30 

25 ctggtaccag aaaaactata tcagagcaac cccaac 36 



<210> 31 

<211> 26 

30 <212> DNA 

<213> Synthetic 



<400> 31 

ttggatccat ggagtctccc tcggcc 26 

35 



<210> 32 

<211> 26 

<212> DNA 

40 <213> Synthetic 



54 



<400> 32 

ttggatccct atatcagagc aacccc 



